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CGTTATCAAA CTCATTACCA ATTGAAACAA AAAACTGTGG TTAGAGCCTT TCGGAAATCG 7 5 60 

TCAAGCGATT GGAGGAAATG AACTAATCCA CAGTGGCTTA TTCCAAGTAT ACCACTTGGG 7 62 0 

CTTTGGCAGT AGCTAACTGC GCTAAATATA ATATAAGGAG AAATAGATGG ATTTATGGTT 7680 

TTCTGAAGTT CATACTCCAG ATGTCAAATT GTCTCTGAGA ACAGCCAAGC AACTTTACGC 7740 

TGGAAAAAGT GAATGGCAGG ATATCGAAGT CTTGGATACG CCAGCTTTTG GGAAAATACT 7800 

GATTTTAAAT GGCCATGTCT TGTTCTCAGA TGCGGATGAT TTCGTCTACA ATGAAATGAC 78 60 

CGTTCACGTT CCCATGGCTG TCCACCCAAA TCCAAAGAAA GTATTGGTTA TTGGGGGTGG 7920 

TGACGGCGGT GTTGCCCAAG TATTAACCCT CTATCCTGAA CTGGAGCAAA TTGATATTGT 7980 

GGAACCGGAT GAGATGTTGG TCGAGGTCTG TCGTGAGTAT TTCCCAGACT TTGCTGCAGG 8040 

GCTAGATGAT CCTCGTGTTA CCATTTACTA CCAAAATGGG CTACGCTTTT TGCGAAACTG 8100 

CGAAGATGAT TACGATATTA TCATCAACGA TGCGACAGAT CCATTTGGCC ATACGGAAGG 8160 

ACTCTTTACC AAGGAATTCT ACGGCAATAG TTATCGAGCT CTGAAGGAAG ACGGCATCAT 82 2 0 

GATTTACCAG CATGGGAGTC CCTTCTTTGA CGAGGATGAG TCGGCCTGCC GAAGCATGCA 82 8 0 

CCGCAAGGTC AATCAAGCCT TTCCAATCAG TCGGGTTTAT CAGGCCCATA TTCCAACTAG 834 0 

CCCAGCTGGC TATTGGTTGT TTGGATTTGC ATCGAAAAAA TACCACCCTG TCAAAGATTT 8400 

TGACAAGGAA GGCTGGAAAA AACGCCAGCT TTTCACAGAA TACTACACTG CAAACTTACA 846 0 

CGTGGGAGCC TTTATGTTGC CCAAGTATGT TGAGGACATT TTAGAAGAAG AGGAAGGAAA 8520 

AAAATGAGTC GTTTACTAGT TATTGGTTGT GGGGGCGTTG CCCAAGTTGC TATTTCAAAG 858 0 

ATTTGTCAAG ATAGCGAAAC ATTTACAGAG ATTATGATTG CTAGCCGTAC CAAGTCAAAA 864 0 

TGCGATGACT TGAAAGCGAA GCTAGAAGGC AAAACAAGTA CTAAAATTGA AACTGCAGCA 8700 

CTTGATGCTG ACAAGGTTGA AGAAGTGATT GCCCTGATTG AAAGCTACAA ACCAGAAGCT 87 60 

GTTTTGAATG TAGCTCTGCC TTATCAAGAT TTAACCATTA TGGATGCTTG TTTGGCAACA 882 0 

GGTGTTCACT ATATCGATAC AGCCAACTAC GAAGCAGAAG ACACAGAAGA CCCTGAGTGG 8880 

CGTGCTATCT ACGAAAAACG TTGTAAGGAA CTTGGTTTTA CAGCCTACTT TGACTACTCA 8940 

TGGCAGTGGG CTTATCAAGA GAAATTCAAA GAAGCAGGCT TGACTGCTCT TCTTGGTTCT 9000 

GGTTTTGACC CAGGTGTAAC TAGTGTCTTT TCAGCTTATG CCCTCAAACA CTATTTTGAT 9060 

GAAATCCATT ATATCGACAT TTTAGACTGT AATGGCGGTG ACCACGGTTA TCCATTTGCA 9120 

ACCAACTTTA ATCCAGAAAT TAATCTCCGT GAGGTTTCTG CGCCAGGTTC TTACTGGGAA 9180 

GATGGGAAAT GGGTCGAAGT CGAAGCTATG TCTATCAAGC GTGAGTATGA TTTCCCTCAA 9240 

GTTGGACAAA AAGATATGTA TCTCCTTCAC CATGAAGAAA TCGAATCATT GGCCAAGAAC 9300 
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ATTCCAGGTG TCAAACGCAT TCGTTTCTTT ATGACTTTTG GTCAATCTTA CTTGACGCAC 93 60 

ATGAAATGTC TTGAAAATGT TGGACTCCTT CGTACGGATA CCATTAACTT TAACGGCCAA 94 2 0 

GAAATTGTTC CAATTCAATT TTTGAAAGCC TTGCTTCCAG ATCCTGCCAG TCTTGGGCCA 94 8 0 

CGTACAGTCG GAAAAACCAA TATTGGATGT ATCTTTACAG GTGTCAAAGA CGGTGTCAAA 9540 

AAGACTATCT ATATCTACAA TGTCTGCGAC CATCAGGAAT GTTACGCAGA GGTTGGTTCG 9600 

CAAGCTATTT CTTATACGAC AGGAGTTCCA GCCATGATTG GGACAAAATT AGTCATGAAC 9660 

GGAACTTGGA AACAAGCTGG AGTGTATAAC CTTGAGGAGT TAGATCCAGA TCCATTCATG 972 0 

GAAGCTTTGA ATGAGTATGG TTTGCCATGG GTTGTGGTTG AAAATCCACA AATGGTGGAC 97 80 

TAATGAAGTT AGAACAAGTA CCAACACCAG CCTATGTTAT TGACTTGGCC AAGTTAGAAG 9840 

CTAATTGCCG CATTCTACAA TATGTACAAG AAGAGGCCGG TTGCAAGGTC TTGCTTGCCC 99 00 

AGAAGGCATA TTCCCTCTAC AAAACTTATC CCTTGATTAG CCAGTATCTA TCAGGTACGA 9960 

CAGCTAGTGG ACTCTATGAG GCCAAATTGG CAAGGGAAGA ATTTCCTGGT GAAGTCCATG 1002 0 

TATTTGCGCC TGCTTTCAAG GATGCAGACT TGGAGGAATT GCTAGAGATA ATGGACCATA 10080 

TAGTCTTTAA CTCAGAGAGA CAGTTGCGTA AACACGGTCC GCGTTGTCGA GAGGCTGGTG 10140 

TCAGTGTTGG TTTGCGCCTC AACCCTCAGT GTTCAACTCA AGGcAGATCA CGCGCTCTAT 10200 

GACCCTTGTG CACCAGGTTC TCGCTTTGGA GTTACTATAG ACAAGATTCC GAGTGATTTG 10260 

CTAGATTTGG TTGACGGACT TCATTTTCAT ACCCTTTGCG AGCAGGGAGC AGATGATTTA 10320 

CAAACAACTT TGAAAGCAGT AGAAGAACAG TTTGGTCCCT ACTTACATGA GGTAAAATGG 10380 

CTCAATATGG GTGGTGGTCA TCATATTACA AGAGAAGGTT ACGATGTGGA TTTGCTGATT 1044 0 

TCAGAAATCA AGCGTATCCG AAAAACTTAC AATCTTGAAA TCTATATCGA GCCTGGTGAA 10500 

GCCATTGCGC TTAATGCGGG TTATTTAGCA ACTGAGGTAT TAGATATTGT AGAAAACGGT 10560 

ATGGAAATCT TGGTTTTAGA CGCCTCTGCG ACCTGCCATA TGCCTGATGT ACTTGAGATG 10620 

CCCTATCGTC CACCTTTGAG AAATGGCTTT GAGTCACAGG AAAAAGCCCA TACCTACAGA 10680 

CTTTCTTCTA ATACCTGTCT GACGGGCGAT GTGATTGGTG ATTATAGTTT TGAAAATCCA 10740 

GTCCAAATCG GAGACAGACT TTATTTTCAA GACATGGCCA TTTATTCTTT TGTCAAAAAT 10800 

AATACCTTTA ATGGTATTGG ATTGCCAAGT CTCTATCTCA TGGACGAACA GGGAGACTGT 10860 

AGCTTACTCA AAGCTTTTGG CTATCAAGAC TTTAAAGGGA GATTATCATG ATGGACAGTC 10920 

CAAAAAAATT AGGCTATCAC ATGCCAGCAG AGTACGAACC CCATCATGGT ACCCTCATGA 10980 

TATGGCCGAC TCGACCAGGA TCATGGCCTT TTCAAGGAAA GGCTGCTAAA AGAGCATTTA 11040 
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CTCAGATTAT CGAGACCATA GCAGAAGGGG AAAGAGTCTA TCTTTTGGTG GAGCAGGCCT 11100 

ATCTATCTGA AGCCCAATCC TATCTTGGAG ACAAGGTTGT TTATTTAGAC ATTCCCACCA 11160 

ATGATGCCTG GGCGCGTGAT ACTGGCCCAA CCATTCTCGT CAATGATAAA GGTAAGAAAT 1122 0 

TAGCCGTGGA TTGGGCCTTC AATGCTTGGG GAGGCACCTA TGATGGTCTT TATCAAGATT 11280 

ATGAAGAGGA TGACCAAGTA GCCAGTCGTT TTGCTGAGGC CTTGGAAAGG CCTGTCTATG 11340 

ATGCTAAACC TTTTGTACTG GAAGGAGGCG CAATCCATAG CGATGGTCAA GGAACTATTC 11400 

TCGTAACTGA AAGTTGCTTG CTTAGTCCTG GTCGCAATCC TAACTTGACT AAAGAGGAGA 11460 

TTGAAAACAC ATTATTAGAA AGTCTTGGTG CTGAAAAAGT TATTTGGCTT CCTTATGGTA 11520 

TTTATCAGGA TGAAACCAAT GAACACGTCG ATAATGTTGC TGCCTTTGTT GGTCCTGCTG 11580 

AGCTTGTTTT GGCTTGGACA GATGACGAAA ATGATCCCCA GTATGCCATG TCAAAAGCAG 11640 

ATCTCGAACT CTTAGAACAG GAAACAGATG CAAAAGGTTG TCACTTCACC ATTCATAAAT 11700 

TGCCTATCCC TGCAGTTCGA CAAGTTGTGA CAGAAGAAGA TTTGCCAGGC TACATCTATG 11760 

AAGAAGGAGA AGAAAAGCGA TACGCAGGTG AACGACTAGC AGCTTCCTAC GTAAACTTTT 11820 

ATATCGCCAA CAAGGCTGTC TTGGTTCCAC AGTTTGAGGA TGTAAACGAC CAAGTGGCCT 11880 

TAGATATCCT CAGCAAGTGT TTCCCAGACC GTAAAGTTGT CGGAATACCA GCCAGAGATA 11940 

TTCTCTTAGG TGGTGGCAAT ATCCACTGTA TCACCCAACA AATTCCAGAA TAGGAGAAAA 12 000 

AGATGAGAAA TGTAAGAGTT GCAACCATTC AGATGCAATG CGCTAAGGAT GTGGCAACAA 12060 

ATATCCAAAC CGCAGAGCGT TTAGTACGTC AGGCTGCTGA GCAAGGAGCC CAAATTATTC 12120 

TCTTGCCCGA GTTGTTTGAA CATCCCTATT TCTGTCAGGA ACGTCAGTAT GACTACTACC 12180 

AGTATGCCCA ATCTGTAGCG GAAAATACTG CCATTCAGCA TTTTAAGGTG ATTGCTAAGG 12240 

AACTACAAGT TGTTTTACCA ATCAGTTTCT ATGAAAAAGA TGGTAATGTC TTGTATAACT 12300 

CTATTGCCGT CATTGATGCA GATGGGGAAG TGCTGGGCGT TTATCGAAAG ACCCATATAC 12 360 

CAGATGACCA TTATTATCAA GAAAAATTCT ATTTCACGCC TGGTAACACT GGTTTCAAGG 12420 

TCTGGAATAC TCGCTATGCT AAGATTGGTA TCGGTATCTG TTGGGATCAA TGGTTCCCTG 12480 

AAACAGCGCG CTGTCTTGCA TTGAATGGTG CTGAATTGCT CTTTTATCCT ACAGCTATCG 12540 

GTTCAGAGCC AATTTTGGAT ACAGATAGTT GTGGTCACTG GCAACGTACT ATGCAAGGGC 12 500 

ACGCAGCAGC GAATATTGTT CCAGTCATCG CAGCCAATCG TTATGGTTTA GAGGAGGTTA 12 660 

CTCCTAGTGA GGAAAATGGC GGACAGAGCT CCAGTCTTGA CTTCTACGGT TCCTCCTTTA 12720 

TGACGGATGA AACAGGAGCT ATTCTAGAAC GAGCTGAAAG ACAAGAAGAA GCTGTTCTGT 12780 

TAGCTACTTA TGACCTAGAC AAGGGAGCAA GTGAACGCCT AAACTGGGGC TTGTTTCGAG 12840 
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ATAGAAGACC AGAAATGTAT AGACAAATTA CAGATTAGTG TGGGAGAAAT GAGAGATTCA 12 9 00 

TTCTGCTAGA CTAACTTCTT ATTAGTAACT ATAAGATACT ATGGCATCTA GTAAATCGAT 12 960 

TTTTATGATT CGCTATTCTT GTCTATTGAT TAGTCCGTAT TTTAAAATAT TAGCAAAAAA 13 02 0 

GCAAATAGCA GTAACTTCTG TCTATTTGCT TTTCTTTTTT ATAGAATATA TTTCTCAATA 13080 

GCACGCGCAA CGCCGTCTTC TTCGTTGCTT GAGGTAACGG CATCCGCAAG AGATTTGATA 13140 

TAATCGCTGG CATTTCCCAT TGCAATCCCA AGCCCTGCAA ACTGGAGCAT TTCGATATCG 132 00 

TTATTAGCAT CGCCCATGGC CATAATCTCT GAGGAATCAA TCTTCAAAAT CTCAGCTAGT 132 60 

CGTGAAAGAG CAGTAGCCTT TGTCGTTCCA AGCGGCATTG CTTCATAAAT GACAGGCTGC 133 20 

GAACGAACTC CACTGAATCG TTGGCAAAGC TCTTCAGCAA AACGCTGCTC AAAATCGTCT 133 80 

GTTTGTTCTT TTGTTCCTAA ACACATACCT TGGAACATCC GGAACTTTCC ACTAGTCGCT 13440 

TCTTCAAGAG AAATTTCAGT CAGGTCTGAA AATACTAGTT TAGCATCATT TTCAATAACT 135 00 

TGATTGGGCT TGTCACCGAG AACAAAATAA TGTGACTCGT CAAAAAGTGT CAACTGAACA 135 60 

TCACTCTTTT CAGCAAGGTC ATAGAGGTAT TCGATGTCAG CTGGACTCAG TTCTTTCCAG 13 62 0 

TCAACTAGAC TCCAATCACT GGTCTGGTGA GTTGAACAAC CGTTGTTAAC AATAATATAT 13 6 80 

TCGTTCTGGA GGTCAAGCTC CAGTTTTTTG TAGTAGGGGA GGACACCGAA AAGGGGGCGA 13740 

CCCGTACAGA GAACCAGTTT GACACCTTTT TCAATGGCTT TGTGAATAGC AGTAATGTGT 13 800 

GCTTGTGGGA TTTCCTTGGC TTCATTGAGG AGGGTGCCGT CCATATCCAA GGCTAGTAGT 13860 

TTAATCATAG GTCTTCCTCT TTATCTTTGC TATTATTATA GCATATTTTG GAGAAGAAAT 13920 

TGATAGAAAG CTTGAGACTA ATTGATTTTA TAGTTTAAGA TGTTTTGATG ACAATTCATG 13 9 80 

ATTTGAAGAG GATATTTCGC AAAGATATGC TATACTATGT TTGTCAATGT TGCAACTAGA 14040 

CAAATTAAAA AACCAACTTA ATATAATAGT TTTTTTGTAA GTAGGTATGA GTAGCAGATT 14100 

ACTCAACTAA TCTGAAGAAT AATGGAGGAA ATATATCATG ATTTTAATGA CAAAAAATAT 14160 

AAATCTAACA AATGAAGAAT TAGAGCTGAT ACAAGGTGGA GCAGATCCAT ATGGTAAAAA 142 2 0 

TCCTAATGGT AGGTACGATT GGGAAATAGA ACCAGTATTA ACTCTGCTGG TTCATGGATT 142 80 

TTGTCCCAGA GGCACCTATG ATTCAGGATA TATTGGAGGA GGTAATCATC TTTGCAAAGG 14340 

AAGTGCTGCG AGATTTTAAG TAAAATTTAT TAGGAATATG AAGAAACAAG GGGAGAAAAC 14400 

AGAGGATTTA ATATGAAAAA ACGAGCTATT CAAATTTTAC TAGCATTGTC CTTAATTTTT 144 60 

TACAAATCAA CTTGGTTTTG GAGGCTTTTC AATTATCTCG CAAAGCCCTA TCTACCAGCA 1452 0 

AGTCGTGAAT TTTTTCAGAT TCTGCTTTTG ATGGAGAGCG GAGTTCTTTT CTTAGCGGTC 14580 
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ATCTATCTAC TGGTTTTTGC AGGAAAGAAA ATTTTTCATT TCAAGTGGCA GCTGAGGTAC 14 640 

TTCATCTACC TTTTACTGGG CTACATCATT TCATATATGT CTGACTTCCT CTTTTCGTAT 14700 

TTCATATCCC TGTCTTCAAA TCAGATTTCT TTGAATGAAA CGGTAGAAAT GATGGGGAGA 147 60 

CAGGAGTTCC CTTATGTCTT GCTCATCGTT TGCTTCATCG CCCCTATTGC TGAGGAATTG 14 820 

ATTTATCGAG GtGTGCTTAT GACAACCTGT TGCAAAAACT CACCTTGGTA CG 14872 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

CGTGCTATCG GTCTCAAAAC CAATCTGGTC GCTATGGTCA AATCCAGTTG GAAAATCCAT 60 

TCTTCTTGGA GCCATCTGCT GGATTGCCAT CATCCTCACC ACTCTTGGTA TGCAGACCCT 120 

TATCGGCATT TTCTAATACT CTTCGAAAAT CTCTTCAAAC CACGTCAACG TCGCCTTGCC 180 

GTAGGTATAT GTTACTGACT TCGTCAGTTC TATCTGCAAC CTCAAAACGG TGTTTGAGCT 240 

GACTTCGTCA GTTCTATCTG CAACCTCAAA ACGGTGTTTT GAGCTGACTT CGTCAGTCGT 3 00 

ATCTACAACC TCAAAACAGT GTTTTGAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA 360 

ACAGTGTTTT GAGCAGCCCG TGGCTAGTTT CCTAGTTTGC TCTTTGATTT TCATTGAGTA 420 

TAACACAAAA GGTAGCCCAT CAGCTACCTT TTTCTTATGC TTCCTCAATC AAGCGAGTAT 4 80 

GTTCTCTCTT GATACAGCGA TTCATCACGA TATCATCACA TCCACCATCA CGCAAAATCT 540 

CTTTCGCTTC TAAACTTTCA AGTCCTAGCT GTGCCCAAAA AATCTTGGCA TCAGCTTTGA 6 00 

GAAAATCACG CGCCACATCG GGCAGAAATT CACTGCGACG AT AAACATT G ACAATATCTA 660 

CAGGAAAAGG AATTTCAGCG AGGCTAGCAT AAGCCTTTTC ACCCAAGATT TCGCCACCTG 720 

CCGCCTTGGG ATTGACTGGG ATGATTTTAT AGCCCCGAGC CTGCATTTCC TTTGTTACTC 7 80 

GATTGCTGGT TGTTTCTTCA CGGTCAGACA AACCCACCAC AGCAAGGGTT TTACTCGTTG 840 

CGAGATACTG ACGAATCACG CCATCACTTG GATTGATAAA TTCTTGACTC ATAGAAATCC 900 

TCCTTTTTCA TCAGTATAGC ACATTTTGAA AAGGTTTGCA GAATTATACT ACAAAAAAGG 960 

AGGACTAGCC CCCTTTTTAT TTAGCCTCGT ACCAGGTTGC CCCTTCATTC TCATCTGCGA 102 0 

TAAGAGGAAC ACTGAGTTGA ATGGCTTCTT CCATGGTTTG TTTCACCAAT TTTTTCATCT 1080 

CTACCAATTC AGATTTAGGC ACTTCAAGGA CGATTTCATC GTGCACTTGT AACAGCATCT 114 0 
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TAGTCTGATA ACCACCTGCA ACCAAGGCTT TATCCAGCTG AATCATGGCA ATCTTGAGAA 12 00 

TATCTGCTGC CGAACCCTGG ATAGGTGAGT TGATAGCAGT TCGCTCCGCA AAACCACGAA 12 60 

TATTGAAGTT GCGCGAATTG ATATCTGGCA ACTCACGGCG ACGCTTAAAG AGGGTCTCTA 13 2 0 

CATAGCCCTT ATCACGCGCC TCCCGCACCA CTTCATCCAT GTAGTTTTTA ATACCTGGAA 13 80 

AACGTTCAAA GTAGGTATCA ATGTAGGCTT TGGCTTCCTT ACGACTAATT CCCAAATTAT 1440 

TAG AC AAG C C AAAGTCTGAA ATCCCATAAA CCACTCCAAA GTTAACTGCC TTGGCATTGC 15 00 

GACGGTCGTT TGCAGTCACA TCATCAGGAC GCTCAATGCC AAAGACCCGC ATGGCTGTCG 15 60 

AAGTATGGAT ATCTGCCCCC TCTTGGAAGG CCTTAATCAA GTGCTCATCC TTAGAAATAT 162 0 

GCGCCAAAAC GCGCAATTCA ATCTGTGAAT AGTCAGAGCT GAGTAGCACA CTATCCTCCC 1680 

ACTCTGGCAC AAAAGCCTTC CGAATCAAGC GCCCCTGTTC CAATCGGGCA GGAATATTTT 1740 

GCAAGTTTGG ATCCACACTA GACAAACGCC CGGTCTGGGT CAAATCCTGC ACATAGCGAG 1800 

TATGAATCTT TCCATCAGCC AAAATCCAGT CCTGCAAGCC AATTACATAA GTAGATTGAA 1860 

TCTTAGCAAT TTGACGGTAA TCCAGGATTT TCTTAACAAT CGGAGCAATA GGAGCGAGAC 19 2 0 

GCTCTAAAAC ATCCACTGCT GTCGAATAAC CTGTCTTGGT TTTCTTAGTG TATTCTAGAG 1980 

GAAGTCCCAA TTTCTCAAAG AGAAGCACGC CCAACTGCTT AGGCGAGTTG ACATTAAACT 2040 

CCTCACCAGC CAGCTCGTAA ATCTCTTGAG TCAGTTTTTC AATGACAAGC TCATTTTCAG 2100 

CCTGCATCTC AAGCAAGGTC TCTTTCTTGA CCATAATCCC AGCAATTTCC ATCTTGGCAA 2160 

GGACAAAAGC CAGAGGTTGC TCCATATCAT AAAGAAGCTC TAATTGCCCA TTTTCGCTGA 222 0 

GTTTTTCAAG TAAAATAGGC TCTGTTTCTA CCAAAACAGC AAGTTTACAA GCTAAGTGTT 22 8 0 

CCAAGAATTT CTCACGTTCA GGAATGGCCT TTTTAACACC CTTACCGTAG AAAGTTTCAT 234 0 

CATCAACCAA GTAAGTCTGA CCATAAAGAC TAGCGATGGT CGCAATTTCA TTGTCCTCCA 2400 

CAGTCGAAAG GAGGTATTTA GCCAAACGGA TGTCAAAAGC AGGCGCCTGC AAATCCACAC 24 60 

CAAAACGTTG CAAAAGAACT TTAACCTTCT TAAAGTCATA AACTCTCAGA GATGTTTTTT 252 0 

CTAAGAAATC CTTGAAAATC GGGTCTTGCA ACAGCTCAAG CTTGTCTGTG GCATAGAGCT 2580 

TATCCCCACA AGACCAGACA AATCCAACCA AATTATCCGT ATGGTAATTC TCACCAAAAA 2640 

GCTCAAAGTG GAAGATAGAC TCTTCACTCA GCATATCTTG ACTGATTTGG TCAACAATAG 27 00 

TAAAATCCAA ACTCTCAGAC ACATCAGCTG ACGACACATT TAAAGCCTGC TTTAGCTGTT 27 60 

TGAAGCCCAT CTCATCGTAG AATTTCCCAA GATTTTCAAC ATCTGGACCA CTATAGACCA 282 0 

AGTCCTCTAA ACCAATCGCA ATCGGTGCCT TGGTATCAAT GGTCGCTAGT GTTTTAGACA 2880 



WO 98/18931 



PCT/US97/19588 



604 

AAAAGGCCTG TTCCTTGTCA TTGATGAGAT TTTCCTTCAT CTTAGAAGTC TTCATTCCAT 2 940 

CAATATTTTC ATAAATCCCC TCAAGCGAAC CATGCTCCAG CAAGAGCTTA ATACCCGTCT 3 000 

TTTCACCGAC TTTGGTCACC CCAGGGATAT TATCCGACTT ATCACCCATG AGCGCCTTGA 3 060 

GATCGATAAA CTGAGCTGGT GTGAGGCCCA TTTCTTCCAT GAGGTAATCT GGCGTAAAGG 312 0 

CCTCAAACTC AGCCACACCT TTCTTGGAAA TTTCAACCAC CGTATGCTCA TCCGTCAGCT 3180 

GAATCAAATC CTTGTCCCCA CTGACAATAG TAATATCAAA ACCATCCTGC TCTGCTAGCT 3240 

TATCCAGCGT CCCAATGATG TCATCCGCCT CATACTGAGC CAGATCATAG TGACGAATCC 3300 

CCATATGATC CAGCAACTCA CGAATGAAAG GAAATTGCTC ACGAAACTCA TCAGGAGTCT 3 3 60 

TGGCCCGACC ACCCTTATAG TCCGCATACA TCTCTGTCCG GAAGGTCGTC TTTCCCGCAT 342 0 

CAAAAGCCAC CAAAATATGA CTCGGCTCAA CCCGCTCCAA TAAATGACTC AACATCAACT 3 480 

GAAAACCATA AATCGCATTG GTATGCAAAC CAGCCACATT CTTAAAACGG TCCAACTGCT 3 54 0 

GATACAGCGC AAAAAACGCC CGAAAAGCTA CAGAAGACCC ATCAATCAAT AATAATTTTT 3 6 00 

TCTTATCCAT ACACCCATTA TAAAGGAAAG AATCAAAAAA TACCATTGGG AAGAGCTAGA 3 6 60 

GCAAGTATTT TTCAAACTTT TTCCGAATAA ATAGATAGAG CCAGAGAATT TAGTAAACCT 372 0 

AGATTTAAAA ATGTGCTATA ATATAGTATA TTGAATCTAT AATAGTACAC CTTGACTGCT 37 8 0 

AAAATATTTC TATAAATTAA TTTGACTTTC CTGATAGAGT TATTCACATC TTATTTCAAC 3840 

TCACTATAGA AGGAGGAATA GGAGGATTCT CAGACATCCG GGCATCAGCC CAACTAATGA 3900 

TTTGATTGCT AAGAAAATAT TCAGCAATCC AGAAATCACT TGTCAATTTA TTCGCGATAT 3960 

GCTGGACTTG CCAGCAAAAA ATGTGACCAT TTTGGAGGGA AGCGATATTC ACGTATTACT 4020 

CTCCATGCCT TACTCGGTGC AGGATTTTTA TACCAGTATA GACGTCTTGG CGGAGTTGGA 4080 

TAACGGTACT CAAGTAATTA TTGAGATTCA AGTCCATCAT CAGAATTTTT TCATCAATCA 414 0 

CTTGTGGGCT TACCTGTGCA GTCAGGTTAA TCAAAATCTT GAAAAAATTC GTCAGCGAGA 42 00 

AGGTGATACT CACTAGAGCT ACAAACACAT CGCTCCTGTT TACGCCATTG CTATCGTGGA 42 60 

TAGTAATTAT TTCTCAGATG ACCTGGCTTT TCATAGCTTT AGTATGCGCG AAGACACAAC 43 2 0 

AGGTGAGGTA TTGGCGATTA CCAACAATGG ACAGGAAAAC CATCTGGTTA AGATGGCATT 4380 

CTTGGAATTA AAAAATACAG AGAAACCAGC AAAGACAAGG TTCGCAAGCC ATGGTTGGAG 444 0 

TTTTTCGGCA ACAAGCCCTT TACCCAGCAA CCGCAACGAG CCATTACCCA AGCAAATCAA 4500 

CTGCTGGACT ACAAGAGCTG GTCCGAGGAG GACAGGAAAA TGTTTAGTCA ACTACATATG 45 60 

CGAGAAGAAC AAGTCTTGTT AGCACAGGAC TATGCCTTGG AAACTGCTAG GGCTGAAGGC 4620 

CTTGAACAAG GACTAGAGCG TGGGAAAGTT GAAGGAAGGG CAGAAAGGAA ACTTTTTGCC 4680 
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TTCCTAGACA TAGTACGCCA AGGTCTTCTG ACTTCTGAGG TTGCCAGCCA GCAATTAGGT 4740 

ATGTCAGTAT CTGAATTTGA GGCACTGTTG TAAAATGGCT CCATAATATC CATAGTGGGT 4800 

AAATCCCCTA TGGATATTAT GGAGCCTATT TTGTGTAGAA AAAAAGTCCC ATATGACCTA 4 8 60 

TAATGAAAAG CGACAAAACA ACTCATTAGA AAGAATCATA TGGAACAATT ACATTTTATC 4 920 

ACAAAATTAC TAGACATTAA AGACCCTAAT GTCCAGATTT TAAACATCAT CAATAAGGAT 4 980 

ACACACAAGG AAATCATCGC CAAACTGGAC TACGACGCCC CATCTTGCCC TGAGTGCGGA 5040 

AACCAATTGA AGAAATATGA CTTTCAAAAA CCTTCTAAAA TTCCTTATCT TGAAACGACT 5100 

GGTATGCCTA CAAGAATTCT CCTTAGAAAG CGTCGATTCA AGTGCTATCA CTGTTCAAAA 516 0 

ATGATGGTCG CTGAAACTTC TGATGACGTA CAGTCATATT TCTTCTCTTT TTATTATATC 522 0 

ACAGTTTTAA ATCTAGCTTT ACTAGATTCA CCGCTACTAT CTATTTATTC GGAAAAAAGA 52 8 0 

CGAAAAAACC TGAGAATCAT CTCAGGCTTG GTCATTAAAT TTTTTTCTCA ATATCGAAAA 5340 

GTGGAGAAAG TGGTCGTTTT TCATGAATAC GTACGATAGC ATCCCCTAGG AGATGAGCGA 54 00 

TTGAAATCTG CTCAATCTTA TCAATCAAAC GCTCTTCTGG CAGATAGATG GTATCCAAAA 54 6 0 

CAACCAATTT CTTAATAGCT GATTTTTGGA TATTGTCCGT AGCAGGACCA GAAAGAACTG 552 0 

GGTGCGTACA GCTTGCATAG ACTTCAACAG CACCAGCTTC CGCAAGAGCA TCTGCCGCAT 558 0 

GACAAATCGT TCCAGCGGTA TCAATCATAT CATCAATCAA GATACAAGTC TTGCCTTCAA 5640 

CCTTACCGAT GATATTCATA ACTTCACTAG TATTCATCTT ATCAACGCTA CGACGTTTAT 5700 

CAATAATAGC GATAGATGTT TTCAAAAATT CTGCCAACTT ACGAGCACGA GTCACCCCTC 57 60 

CATGGTCCGG GCTGACAACC ACATAGTCAG AACCAACCAT ACCACGACGC TCAAAATAAT 582 0 

CTGCAATCAG AGGAGCACCC ATCAAATGAT CCACAGGAAT ATCAAAGAAT CCTTGAATTT 5880 

GCGCAGCATG CAAGTCGATG GTCAATAAAC GATCCACTCC AGCTACTTCA AGCATATTTG 594 0 

CGACAAGTTT TGAAGTGATT GGCTCACGCG CTCTCGCCTT TCTATCCTGA CGTGCATACC 6000 

CATAGTAAGG CATGACAACA TTGACAGATT CTGCACTCGC ACGCTTCAAA GCATCTACCA 60 60 

TAATCAAAAT TTCAAGCAGA TTGTCATTTA CAGGCGAACT AGTTGATTGT AAGATAAAGA 6120 

CGTGTTTCCC ACGGATTGAT TCTTCAATGT TGACCTGAAT CTCTCCATCT GAAAATTGGC 618 0 

GAACACTTGA TTTCCCCAAC TCTATCCCAA TCTCCTGCGC CACACGTTCT GCCAATTCTT 6240 

TATTAGAAGA AAGGGCAAAC AGCTTTAAAT CAGAAAAAGA CATGATTTCC TCCGGTATAT 6300 

ATGTATAACT TGTGCTTTTC ACAAGATTTT CCATCTACCA TTGTAGCGCT TTTTGCACTA 63 6 0 

TTTTTCAATC AAAAATAAAA GAAGGGCACC ATATTTGTAC CCTTGCATCA TTCTTTTGAA 642 0 
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AAATATTCTA GGTCATCAAC TCATTGTGTT TCTCAACAAA GCAATAAGCA TGATAAAAAC 6480 

CATAGAGAGC AATAGCCGTA ACCACTGGAA TCGCTAAAGG CAACTCTGTT TCCAACTCCA 6540 

CAAAAGGAGA GTTAAACAAG AAGTGAGTTC CCAAGGCTAA ACCTAGAAAA ATAAGGCCCT 6600 

GTTTCTTGCC AACCTTCTGT CCTTTATAGG CTCTGTAAAG CAAGTAAACA CCTACTACAG 6660 

CTAGACCTGA AAAAGTCCAG TGAGAGGCAA TTCCTGAGAT GATACGCTCT AAAATTCGCG 672 0 

AAATAGTAAA GTCAAAGCCC TCTGGCAAAT CCGTACGAAT ATAACCAATA TCCTTAATCA 678 0 

TTTGGAATCC CAAACCGGAA GCAATTCCAA GTAAAAACAA AGATTTTAAT TTTCGCACAG 6840 

GAATCAAAGC CAAAACAAAA ACAAGTGACA ATAATTTCAA GGGTTCTTCT ACCAAAGGAG 6900 

CCGCAATAGC ACTTTCAAAG GCATTTAAAA ATGGACTATC TGGGAAAAGA ACCCCCAGTA 6960 

AATCATGGAT ATAAGTATTA GCAAAACTAG ACAACCAGCC TGAAAGGAAC ATCCCTCCCA 7 020 

ATAAAGACAG AATCAAAACC TTCTTTGGCA ATTCCCATTT TTCCCAATAC GGAAGAGAAA 7 080 

ATAAAGAGCC GGAATCATGT AAAAGAGAGC TAGAAAGATA GAAACTCCCA TTAGTCCATA 7140 

TTCCGCACCT GACCTCGAAC CGTCCGTATA GTAGATGGTT TCATACTGTA AACCAATACA 7200 

TAGCAATAAA ATAAAAATAA ATAAAATATT GCTTTTCTTC ATACACTTTC TTTCTAAATG 72 60 

AAGTATTTAT AATTCTACGA CTGTCATACT TCCTGTATCA ACATTGTAAA TGGCACCAGA 7320 

GATAATGACA TCGTCTGGTA TTAGGGGAGA CTCGATAAGC AGTTGCATAT CCTCGCGTAC 7 380 

ACTCTCTTCT ATATCTTGGA AGGGCAAGAA GTCCTGGTCT GACACATCGA CACCCAATTC 7440 

TTCCTTCAAA TACTCCTGAA AAGGTTCATT TTCAAAGGTC TGAGCACCAC AGTCTGTATG 7 500 

ATGCAATACC ACAATTTCTC TTGTCCCCAT TTGTTGCTGG GAAATAACTA GAGAACGAAT 7 560 

CATATCCTCA GTCACTCGAC CACCTGCATT CCGCAAAATA TGAGCATCCC CAAGTGCCAA 7 620 

ACCTAGAGCT TGCGCAACGT GCAAACGTGA GTCCATACAG GTCACAATGG CTACTCTGGT 7 680 

TTTAGGTTTA AGTGGCAGAT TTAACTGCCC ATGTAGGGCA ACATAAGCCT GATTGGCTTG 7740 

CATAAACTGT TCAAAATACG ACACGATTCC CTCCTTGAAA ATTTGATAGT CAAATATTTC 7 800 

TCCTATCTTA TCATTTTTAA GAGAATTTGT CACGGATTAT GCAAAGACCT TTTTCAAGAC 7 860 

TTCCTGAATC GTTGTCACGC CAATGACCTG AATTTCCTTA GGCAGAGTGA TTCCTGTCAA 7 920 

GGAATTCTTA GGTACATAAA TCTTAGTAAA GCCCAGTTTA GCAGCTTCGT TGATGCGTTG 7 980 

CTCAATACGA TTCACGCGCC GAATCTCTCC TGTCAAGCCC AGTTCTCCGA CAAAACATTC 8040 

CTGAGGATTA GTTGGCTTGT CTTTGTAGCT CGAAGCAATA GCAACTGCAA CAGCCAAGTC 8100 

AATCGCAGGT TCATCCAATT TAACACCACC AGCAGATTTG AGATAGGCAT CCTGATTTTG 8160 

CAAGAGAAGC CCTGCCCGTT TTTCCAAAAC AGCCATAATC AAGCTAGCAC GGTTAAAATC 82 20 
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AAGTCCTGTC GTAGTACGCT TGGCATTTCC AAACATGGTC GGTGTTACCA AAGCCTGAAC 8280 

CTCCGCCAAA ATCGGACGCG TCCCTTCCAT GGTTACAACG ATGGAGGAAC CAGTCGCCCC 834 0 

ATCCAAACGC TCTTCTAGGA AAACTTGACT CGGATTGAGT ACCTCAACCA AGCCGCCCGA 8400 

CTGCATCTCA AAAATCCCAA TCTCATTAGT GGAACCAAAA CGATTTTTGA CCGCTCTCAA 8460 

AATACGAAAG GTGTGGTGAC GCTCCCCTTC AAAGTAAAGC ACCGTATCCA CCATATGCTC 852 0 

CAACATACGA GGCCCAGCCA AGGTTCCTTC TTTGGTCACA TGACCTACGA TAAAGATGGC 8580 

AATGTTATTG GTCTTGGCCA ACTGCATGAG TTCAGCGGTC ACTTCACGCA CCTGAGAAAC 8640 

AGACCCCTGC ACCCCTGAAA TCTCAGGAGA CATGATGGTC TGGATGGAAT CAATAATGAG 8700 

AAAGTCTGGC TGGATACGCT CCACTTCTGC ACGAACACTC TGCATATTGG TCTCTGCATA 8760 

GAGATAAAAC TCACTATCAA TATCACCTAA GCGCTCTGCA CGTAGTTTAA TCTGCTGGGC 8820 

AGACTCCTCC CCACTGACAT AGAGAACTGT CCCCACTTGG GACAACTGGG TTGAGACTTG 8880 

TAGGAGAAGA GTTGATTTCC CAATCCCAGG ATCCCCACCG ATAAGGACGA GACTTCCTGG 894 0 

TACCACTCCG CCTCCAAGCA CACGGTTGAA TTCCTCCATC TCCGTCTTGG TTCGATTGAC 9 000 

ATTGATGGAA GTCACCTCAG CTAGTTTCAT GGGCTTGGTT TTCTCACCTG TCAAGGACAC 9 060 

ACGCGCATTC TTAACTTCGG CAACCTCAAC CTCTTCCACA AAAGAAGACC AAGACCCACA 9120 

GTTGGGGCAA CGTCCCAGAT ATTTAGGGGA ATTATACCCA CAATTTTGAC ATACAAATGT 9180 

CGCTTTTTTC TTTGCGATGA CAAACCTCTT TCTATATCTC TAACTCACAC TCAATCACTT 9240 

GGCAAAAATC AATCTTCTCA TTTGGCACAA ACTGGCGCAT GAGCATTCGA TGAGCAACAA 9300 

CTACCACAGT CTGATGTTCT CGATACTTAG ACATACATTC TAGAAACCGA GACTTCATTT 9360 

CCGTAGCTGT CTCATATTGA ATAGGACTAT TAGGAAGCAA CTCCCCCTTG TTTTCTAAAA 9420 

ACAGTCTTCT AGCTGTTTCA AAGTTTTCTA TTCCTGTTTT ATAGACCTGC CATTCATGTA 9480 

ATAAAGGCTC TACTCTTAAA GGAAGACCCG TAGCACAGAC CACATACGAA GCCGTTTCTA 9 540 

AAGCTCTTGT GACTGCAGAA GATACGATTA TTTCAGCTGA CGAGAGTAAA GGATTTTTGC 9 600 

TCAATTTCTG GACTTGCTGC CGTCCCATCT CAGACAAGGG TGCCAAATCT ATCCCAAATC 9 660 

CTATATAAGA ACGCTCCTCT AACTCACGGT AATCTGGCTC CCCATGACGT ACAAAGATAA 972 0 

TCTTCATTCT AGTGCCCTGT CGATCCAAAT CCACCAGTTC GAACGCCATC AGCTGCATCT 97 80 

CCATCTGCAA TTAAGAAAGT AGCAAAAACA GCCTGGACAA TACGCTCCCC AACTTCAAGA 9 840 

ACAACCTCTT GGTCTGTGAT ATTCTTCATC TGCGCAAAAA TATGCCCTTC ATTTCCAGGA 9 9 00 

TTTCCATAAT AATCCCCATC AATGACTCCA ACTGAGTTAA TTAAAACCAA GCCCTTCTTA 9960 
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CGAGGATTTG AAGAACGATC ATAGAGGTAG AGAACCTCAG TCGGCTGCAT ATAAGCCTTA 10020 

ACCCCTGTCG GAACCAAGAC AATCTCTCCT GGCGCAACAA CTGTACGCAC AGCAACCTTT 10080 

AAGTCGTAAC CAGTCGCATG CGCTGTCTCA CGCTTGGGCA ATAAATTTTC ATCTGTAAAA 10140 

CTCGAAACCA ATTCAAAACC ACGAATTTTC ATAATTTTCT CTTTTCTATT ATCATTTATT 10200 

CTAGATTATT CTATACTTAT TTA 10223 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 16535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

TGGTTCTGTC CTTATCGGCG CCTTGTCTTG CTTGCCATGG CTACACCAAC TATCTCATCC 60 

GACGAAAGTA CACCAACCAC TAACGAACCC AACAACAGAA ATACAACCAC CCTTGCCCAA 12 0 

CCTCTTACTG ATACAGCAGC TGGCTCTGGT AAGAACGAAA GTGATATTTC TTCACCTGGA 180 

AATGCAAACG CTTCCCTAGA GAAAACAGAA GAAAAACCTG CTGCAAGCCC AGCCGATCCA 2 40 

QCACCACAAA CTGGACAAGA TCGTTCAAGT GAGCCAACTA CTTCTACTAG TCCAGTAACA 3 00 

ACTGAAACTA AGGCAGAAGA GCCCATCGAA GATAACTACT TCCGTATCCA TGTCAAAAAA 3 60 

CTTCCTGAAG AAAACAAGGA TGCTCAAGGA CTATGGACTT GGGACGATGT TGAAAAACCA 420 

TCTGAAAACT GGCCAAACGG AGCTTTGTCC TTCAAGGATG CCAAGAAAGA TGACTACGGC 4 80 

TATTACCTAG ATGTCAAATT AAAGGGAGAA CAAGCCAAGA AAATTAGCTT CCTCATCAAC 54 0 

AATACAGCTG GAAAAAATCT AACCGGCGAT AAATCTGTAG AAAAACTAGT TCCAAAAATG 600 

AACGAAGCTT GGTTAGACCA AGATTACAAG GTTTTCTCTT ACGAGCCACA GCCTGCAGGA 660 

ACTGTTCGCG TCAACTACTA CCGCACAGAT GGCAACTATG ACAAGAAATC TCTCTGGTAC 72 0 

TGGGGAGATG TGAAAAATCC AAGTAGCGCT CAATGGCCTG ACGGAACAGA CTTTACGGCT 7 80 

ACAGGCAAAT ATGGCCGCTA TATCGACATT CCTCTTAATG AAGCCGCAAG AGAATTTGGA 84 0 

TTTTTATTAC TAGATGAGAG CAAACAAGGA GACGACGTGA AAATCCGTAA AGAAAATTAT 900 

AAGTTCACAG ATTTGAAAAA TCATAGCCAA ATTTTCCTAA AAGACGATGA TGAATCGATT 9 60 

TACACAAATC CATACTATGT CCATGATATC CGTATGACAG GAGCCCAACA CGTAGGCACT 102 0 

TCTAGCATTG AAAGTAGCTT TTCAACACTT GTCGGTGCTA AAAAAGAAGA TATCCTCAAA 1080 

CACTCCAACA TCACTAATCA CCTAGGAAAC AAGGTAACTA TTACCGATGT TGCAATCGAT 114 0 
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GAAGCTGGTA AGAAAGTGAC CTACAGCGGA GATTTCTCTG ACACAAAACA TCCTTATACT 12 00 

GTTAGCTACA ATTCCGACCA ATTCACTACC AAAACAAGCT GGCGCCTGAA AGATGAGACA 12 60 

TACAGCTATG ATGGCAAACT GGGAGCTGAC CTAAAAGAAG AAGGAAAACA AGTTGATTTG 13 2 0 

ACCCTTTGGT CACCAAGTGC TGATAAGGTT TCTGTTGTTG TCTACGACAA GAATGACCCT 1380 

GACAAAGTAG TTGGAACTGT CGCTCTTGAA AAAGGGGAAA GAGGAACTTG GAAACAAACT 144 0 

CTAGACAGCA CAAACAAACT CGGAATCACA GATTTCACTG GCTACTATTA TCAATACCAA 1500 

ATCGAGCGTC AAGGTAAAAC TGTTCTTGCA CTCGATCCTT ACGCTAAATC TCTTGCTGCT 15 60 

TGGAATAGCG ACGATTCCAA GATTGACGAT GCCCATAAAG TGGCTAAAGC CGCCTTTGTA 162 0 

GATCCAGCTA AACTCGGACC TCAAGACTTG ACTTATGGTA AGATTCACAA TTTCAAGACT 1680 

CGTGAAGACG CCGTTATCTA CGAAGCTCAT GTGCGTGATT TCACTTCAGA TCCTGCCATT 1740 

GCAAAAGACT TGACCAAACC ATTTGGGACT TTTGAAGCCT TCATTGAAAA ACTAGACTAT 1800 

CTCAAAGACT TGGGTGTAAC CCATATCCAG CTCCTTCCAG TCTTGTCTTA CTACTTTGTC 1860 

AATGAATTGA AAAACCATGA ACGCTTGTCT GACTACGCTT CAAGCAACAG CAACTACAAC 192 0 

TGGGGATATG ACCCTCAAAA CTACTTCTCC TTGACTGGTA TGTACTCAAG CGATCCTAAG 1980 

AATCCAGAAA AACGAATCGC AGAATTTAAA AACCTCATCA ACGAAATCCA CAAACGTGGT 204 0 

ATGGGAGCTA TCCTAGATGT CGTTTATAAC CACACAGCCA AAGTCGATCT CTTTGAAGAT 2100 

TTGGAACCAA ACTACTACCA CTTTATGGAT GCCGATGGCA CACCTCGAAC TAGCTTTGGT 2160 

GGTGGACGCT TGGGGACAAC CCACCATATG ACCAAACGGC TCCTAATTGA CTCTATCAAA 2 22 0 

TACCTAGTTG ATACCTACAA AGTGGATGGC TTCCGTTTCG ATATGATGGG AGACCATGAC 2280 

GCCGCTTCTA TCGAAGAAGC TTACAAGGCT GCACGCGCCC TCAATCCAAA CCTCATCATG 2340 

CTTGGTGAAG GTTGGAGAAC CTATGCCGGT GATGAAAACA TGCCTACTAA AGCTGCTGAC 2 400 

CAAGATTGGA TGAAACATAC CGATACTGTC GCTGTCTTTT CAGATGACAT CCGTAACAAC 2 460 

CTCAAATCTG GTTATCCAAA CGAAGGTCAA CCTGCCTTTA TCACAGGTGG CAAGCGTGAT 2 52 0 

GTCAACACCA TCTTTAAAAA TCTCATTGCT CAACCAACTA ACTTTGAAGC TGACAGCCCT 2580 

GGAGATGTCA TCCAATACAT CGCAGCCCAT GATAACTTGA CCCTCTTTGA CATCATTGCC 2 640 

CAGTCTATCA AAAAAGACCC AAGCAAGGCT GAGAACTATG CTGAAATCCA CCGTCGTTTA 2700 

CGACTTGGAA ATCTCATGGT CTTGACAGCT CAAGGAACTC CATTTATCCA CTCCGGTCAG 2 760 

GAATATGGAC GTACTAAACA ATTCCGTGAC CCAGCCTACA AGACTCCAGT AGCAGAGGAT 2 820 

AAGGTTCCAA ACAAATCTCA CTTGTTGCGT GATAAGGACG GCAACCCATT TGACTATCCT 2880 
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TACTTCATCC ATGACTCTTA CGATTCTAGT GATGCAGTCA ACAAGTTTGA CTGGACTAAG 2 940 

GCTACAGATG GTAAAGCTTA TCCTGAAAAT GTCAAGAGCC GTGACTATAT GAAAGGTTTG 3000 

ATTGCCCTTC GTCAATCTAC AGATGCCTTC CGACTTAAGA GTCTTCAAGA TATCAAAGAC 3 060 

CGTGTCCACC TCATCACTGT CCCAGGCCAA AATGGTGTGG AAAAAGAGGA TGTAGTGATT 3120 

GGCTACCAAA TCACTGCTCC AAACGGCGAT ATCTACGCAG TCTTTGTCAA TGCGGATGAA 3180 

AAAGCTCGCG AATTTAATTT GGGAACTGCC TTTGCACATC TAAGAAATGC GGAAGTTTTG 32 40 

GCAGATGAAA ACCAAGCAGG ACCAGTCGGA ATTGCCAACC CGAAAGGACT TGAATGGACT 33 00 

GAAAAAGGCT TGAAATTGAA TGCCCTTACA GCTACTGTTC TTCGAGTCTC TCAAAATGGA 3 3 60 

ACTAGCCATG AGTCAACTGC AGAAGAGAAA CCAGACTCAA CCCCTTCCAA GCCTGAACAT 34 20 

CAAAATGAAG CTTCTCACCC TGCACATCAA GACCCAGCTC CAGAAGCTAG ACCTGATTCT 34 80 

ACTAAACCAG ATGCCAAAGT AGCTGATGCG GAAAATAAAC CTAGCCAAGC TACAGCTGAT 3 540 

TCACAAGCTG AACAACCAGC ACAAGAAGCA CAAGCATCAT CTGTAAAAGA AGCGGTTCGA 3 600 

AACGAATCGG TAGAAAACTC TAGCAAGGAA AATATACCTG CAACCCCAGA TAAACAAGCT 3 6 60 

GAACTTCCAA ATACAGGAAT CAAAAACGAA AACAAACTCC TATTTGCAGG AATCAGCCTC 372 0 

CTTGCGCTCC TTGGTCTCGG TTTCTTACTA AAAAATAAAA AAGAGAACTA AACTAGCCCT 37 80 

CCTATAGAAA AATCCCCCAA GCATTATAGC TCGGGGGATT AATTTTTGTA CAATATTTGT 3 84 0 

TGTCCTAATA AACTTGATTA GGATTTTTTA TTAAGCCTCT TTCATAGCAA AATAAGCTCG 3900 

TACTTTGGGT GCAACTTGTG TTCCGAAGAG TTCAATAGCT CTCAGAACCT GGTCATGAGG 3960 

CATAGAACCA AGCGGTAGAT GAAGCATGAA GCGGTCCAAT CCTAAATCCT CTATCATGCG 402 0 

AATCAATTTT TCGGCCACCT GATCTGGATT GCCAACAAAC ATGGCGCCAT TTGGCCCTAC 4080 

CTGCTCCAAA TATTGCTCAT AACGCAATTC CTGCCAGTGC GGACGGTCTT TGGAAATAGC 414 0 

ATCCACCACT TGCTTAGTCG GATGGAAATA ATCTTTCACC GCCTGCTCAC CATCTTCCGC 42 00 

AATCCACCCC CAAGAATGGG CTCCCACTTT CAAGTCTTTG TCAGCATGGC CCCTTCGCTT 42 60 

CCAATCTCAC GATAAGCCTG AATCAACTTT TTAAAATAAC GTGGATTACC ACCAATAATA 432 0 

GCATATACAA TCGGTAGACC AGCCTGAGCA ATCTTCACTG TTGATTCGAC ATGACCACCT 4380 

GTAGCTATCC ACAAGGGCAA TTTGTCCTGA ACTGGACGAG GATAAACTTC TTTACCAGCA 444 0 

ATCGTTTGAG TCAATCGACC TTGCCAGTCT AACTTGGTCT TTTCATTGAC TAACTGAAGC 4500 

AAGTCTAATT TCTCATCAAA AAGAGAGTCG TAGTCTTTCA AGTCATAACC AAACAGAGGG 45 60 

AAAGATTCCG TGAAAGAGCC CCTTCCAGCC ATAATCTCCG ATCGTCCATT TGACAAAGCA 462 0 

TCGATAGTGG CATACTGTTG GAACAAACGA ATCGGGTCCA TGCTTGACAG AATGCTGACT 4680 
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GCACTGGTCA AACGGATTTT CTTGGTATTG ACTGCCCCAG CGGCCAGAAC AATCTCTGGG 474 0 

GCTGATACTG CAAAATCCGC CCGATGGTGC TCACCAATCC CATATACATC CAAACCAACC 4800 

TTGTCAGCCA GCTCAATCTC TGCCACCAAC TGGCGAATGC GTTCAGCATG ACTGTAAGTT 48 60 

TGTCCAGTCC CTTCAAGCTC CGTTATTTCC CCAAATGTTG AAATTCCCAA TTCTACCATT 492 0 

GTGATTCTCC TTATCTATCT CTGTACTTCA ATTTGAAAAA TTATTCTAAC ACGAATCTTG 49 80 

AGTACAAGCA ACCGATTTGC T C ATTAGAAA AAGCCTAGAT AACTAGACTT TTTTAGCTTA 504 0 

TTCTACCGTT ACTGACTTGG CAAGGTTACG TGGTTTGTCC ACATCGAGGC CACGGTGGAG 5100 

GGTTGCAAAG TAAGCGACTA ATTGCGTTGG TACGACCATT GAAATTGGTG AGAGGTATGG 516 0 

ATGTACGGTC GTAAGGACGA TATCGTCGGT ATCTTTGGCT ACATTCTCTT CTGCGATAGT 522 0 

GAGGACTTTG GCACCACGGG CTGCGACCTC TTGGATATTT CCACGAGTAT GATTGGCAAG 52 8 0 

AACTGGATCT GACAAGAGAG CCAAAACAGG CGTTCCTTCT TCAATCAAGG CAATGGTTCC 5340 

GTGCTTGAGT TCTCCTGCAG CAAAGCCTTC ACACTGGATA TAAGAAATCT CTTTGAGTTT 5400 

GAGACTTGCT TCCATGGCTA CGTAGTAATC TTGACCACGT CCGATGTAAA AGGCGTTACG 546 0 

AGTTGTTTCA AGAAGTTCAC GAACCTTGAC TTCAATGGTT TCTTTCTCTG AAAGAGTTGA 552 0 

TTCGATAGAC TGAGCTACGA TTGACAATTC ATGAACCAGG TCAAAGGCTT GCGCTTTAGC 5580 

ATTACCATTT GCTTCTCCGA CTGCTTTTGC AAGGAAGGCA AGGGCTGCGA TTTGCGCTGT 5640 

ATAGGCTTTA GTTGATGCCA CGGCAATTTC AGGACCTGCG TGAAGGAGCA TGGTATAGTT 5700 

GGCTTCACGT GAGAGGGTTG AACCTGGAAC GTTTGTCACT GTTAAGCTTG GAATTCCCAT 5760 

TTCATTAGCC TTGACCAAAA CTTGACGACT ATCCGCTGTT TCACCAGATT GGCTGATAAA 582 0 

GATGAAGAGT GGTTTCTTGC TGAGAAGTGG CATACCGTAG CCCCACTCAG ATGAGATTCC 588 0 

AAGTTCAACT GGTGTATCTG TCAATTCTTC CAACATTTTC TTAGAAGCAA ATCCTGCATG 5940 

GTAAGATGTT CCAGCTGCAA GGATGTAGAT GCGGTCTGCG TCTTGAACAG CCTTAATGAT 6000 

ATCTGGGTCT ACGACAACTT GACCAGCCTC ATCTGTGTAG GCTTGGATGA GTTTCCGCAT 6060 

AACAGTTGGT TGCTCGTCAA TTTCCTTGAG CATGTAGTAA GGGTAAGTTC CCTTACCGAT 6120 

ATCTGACAAG TCAAGTTCAG CAGTGTAGCT AGCACGCTCA CGACGATTTC CATCATAGTC 6180 

TTGAACTTCC ACACTATCAG CCTTGACGAT TACCAACTCT TGGTCATGGA TTTCCATGTA 6240 

TTGGTTAGTT TCACGAATCA TAGCCATGGC GTCTGAGCAG ACCATGTTAT AGCCTTCTCC 6300 

AAGACCAATC AAAAGTGGTG ATTTATTTTT AGCTACGTAG ATGACTTCAG GATCTTGTGA 63 60 

GTCAACCAAG GCAAAGGCAT AAGAACCACG GATGATGTGA AGGGCTTTTT TGAAGGCTTC 6420 
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AAGAACTGAG AGCCCTTCTT CTTCCGCAAA TTTTCCAATC AAATGAACGG CTATTTCAGT 64 80 

ATCTGTCTGC CCCTTGAAGT GGTGACCTGC AAGGTATTCT TCCTTGATTT CAAGATAGTT 6540 

CTCAATCACC CCATTATGCA CCAAGACAAA ACGTTCCGTC TCAGAGCGGT GTGGGTGAGC 6 600 

ATTGTCCTCA GTTGGTTTTC CGTGAGTAGC CCAACGAGTA TGTCCGATAC CAGTTGTTCC 66 60 

CTCAACACCA GCTGTCTTGG CAGACAATTC TGCAATACGA CCAACCGCCT TCACCAAATG 6720 

GTTATCAGCA CCATCTAGGA CAAAAATTCC CGCAGAATCA TAGCCACGGT ATTCAAGCTT 67 8 0 

TTCAAGCCCT TGAATCAAAA TATCAGTTGC ATTTGTGTTT CCAACAACAC CAACAATTCC 6840 

ACACATAGTA TATACGACAC AGGCAAGCTG TGCTTTCTCC TTAAAATTGG TATAGTCTAA 6900 

TTCATCTTTT ATAGAATCAG CAAAAACAGT ATATACTTGT TTCTTTCACT TGTCAAGAGT 69 60 

AAAAATTGGT ATAGTTCAAA TTAAGCTCCT GTAAGCATAA AAACTCTGAC CGATTGGGAT 7 020 

AATCAGTCAG AGTCCTTTTT AAAATCCATT ATTATCGCTT AATTCTTTGA ACCAGTGGCC 70 80 

TGATTTCTTC AGACGACGTT CTTGCGTTTC CAAGTCTAAT TCGACCAAAC CATAGCGATT 7140 

TTTATAGCTG TTGAGCCATG ACCAGCAGTC AATAAAGGTC CAAATCAAGT AGCCCTTACA 72 00 

GTTGGCACCA TCTTCAATGG CACGGTGAAG TTCACGAAGA TGACCTTTTA CAAAGTCAAT 72 60 

ACGGTAATCA TCTTGAATCA TTCCATCTTG ACGGAATTTT TCTTCCCCTT CAACACCCAT 732 0 

ACCATTCTCA GTCAACATCC ACTCAATATT GCCATAATTT TCCTTGATAT TTTGGGCGAT 73 80 

GTCATAAATC CCTTGCTCAT AAATCTCCCA ACCACGGTGA GAATTGATTT TACGTCCAGG 7440 

CATCACATAA GGCTCGTAAA AATGTTCTGG TAAGAGTGGA CTCTCTGGAT GCTTAGCAAA 7500 

TCGAGGAGCC ATAACACGCA AAGGTTGATA GTAGTTCACA CCAAGGAAGT CCACCGTATT 75 6 0 

ATCACGAATG AGTTCCAACT CTTCCTCTGT AGCATCAGGT AAAAGACCGT GTTCATGCAA 7620 

GATTTCTACC AACTCCTGTG GATAAGTCCC CAAGACAGAT GGATCTAAGA AAGATTGGGC 76 80 

CTGAAAAAGG GCCGCAATAC GAGCTGCCTT GACATCAGCA GGATGCTGGC TACGTGGATA 774 0 

AGCCGGTGTC AAGTTGAGGA CAATCCCAAT CTTGGAATCA GGCAAAAGTT CATGGCAAGC 7800 

CTTAACAGCC CGGCTGCTGG CCAATTGTGT ATGATAGGCT ACCTTAACAG CTGCCTCTGC 7860 

ATCCACCTTA TGTGGATAAT GGGCATCATA AAAATAACCA AATTCTACAG GAACGATGGG 792 0 

CTCGTTAAAG GTAATCCATT GATCCACTAA ATCTCCATAA GTCTCAAAAC AAAAACGAGC 7980 

ATAGTCTTCA TAGGCTGAGA CTGTCGCCTT ATTTTCCCAA CCATCACCAT CCTCTTGAAG 804 0 

GGCAAAAGGT AAATCAAAAT GATAGAGATT GACTAACAGA CGAATTCCTT TAGCCTTAAT 810 0 

AGCCTCAAAG ACCTTACGAT AAAAATCCAC ACCTTGAGTG TTGACTTTTC CACAGCCTTG 8160 

TGGAAAAATC CGTGACCACT GAATAGAAGT CCGAAAGGCT GTGTGACCAG TCTCTAACAA 822 0 



WO 98/18931 



PCT/US97/19588 



613 

AAGCTCAATA TCCCGCTCCC AATTTTCATA AAAAGTCGAT GTCTTATCTG AACCAATCCC 82 8 0 

ATTATAGTAA CGATTTGGCT CCACTTGGAA CCAGTAATCC CAGAGATTGT CTCCCTTACC 8340 

GTCACCAGCT ACACGTCCTT CTGTCTGCGG TCCAGAAGTA GAGGATCCCC AGACAAAATC 84 00 

CTTTGGAAAT CTTAGCATAC ATTTACCTCT TTATCTACTC ATTTCTCCCA TTATACAGAA 84 60 

AAAACAAGGT AAAAACTAGT TACATTTTTT CCTTGTTTTT CTTCTGATTA TAGTTTTTAT 8520 

TTCTTGCTTA GGATTTCAAG CGTTTCAAGC ACGTTATCTG CATGAACCTC AATGGTGTCA 8580 

CCAGTTGCCT TGATCTTAAC TTCTACAATG CCATCGGCCG CTTTTTTACC AACAGTGATA 8640 

CGGATTGGAA GACCAATCAA GTCACTATCG CTAAATTTAA CACCGACACG TTCGTTACGG 8700 

TCATCTGTCA AGACTTCATA ACCAGCTCCC ATCAAGCTTG CTTCAAGTTT TTCTGTCAAG 87 60 

GCTTGCGCTT CTTCATCCTT GACATTGACA GTAATCAAAT GCACATCAAA TGGTGCCAAT 882 0 

TCTTTAGGGA AATTGATTCC CCAAGCGTAA CGGTATTCAC CTTTTGGCGT TTTGTTAACA 88 80 

AAGAGGCGAG CGTGTTGCTC CATCACTGCT GAAAGAAGAC GGCTGACACC GATACCGTAA 894 0 

CATCCCATGA TGATTGGCAC AGCACGACCA TTTTCATCCA AGACATCTGC TCCCATGCTT 9000 

GCTGAATAGC GAGTTCCGAG TTTGAAAATA TGACCGATCT CAATACCACG CGCAAAGTTA 90 60 

AGGACACCTT GTCCATCTGG GGAAATTTCA CCCTCACGAA CTTCACGGAT ATCCACATAT 9120 

TCTGCAGTAA AATCACGGCC TGGGTTCACA CCAGTCAAGT GGTAGTCATC TTCGTTAGCA 918 0 

CCGACAACTG CATTGCGAAC ATCTTGTACC TTACGATCTG CAATAATTTT AATATTCTCT 924 0 

GGCAAACCAA CTGGTCCAAG TGAACCAAAT CCTGCTTGAA CAACATTCGC CACTTCTTCT 9300 

TCGCTAGCAA CGTCAAAGAA ATCTGCTCCC AAGTGATTTT TCAACTTGAC TTCGTTGAGT 9360 

TGGTCATTTC CAACTAGAAG GGCTGCAACA AGCTCACCAT CTGCAATGTA GAAGAGGGTT 942 0 

TTAATCGTTT GTTCTTCTGG AACATTGAGG AAGGCTGCAA CTTCATCAAT TGATTTAACA 94 80 

TCTGGCGTTG CAACACGAGT AACTTCTTCT TCAGCGACAA CACGGTTGCT TGGTTTGTAC 954 0 

TCGTTTGTTG CCATTTCTAA GTTAGCTGCA TAGCTAGACT CACTTGAGTA AGCAATGGTA 9600 

TCTTCACCAG AGACTATCCA TTTGAGCAAT TCTGCCTTGA TTTCTTCTTG CACTTCTGCA 9660 

GGAATTTCGT CAAATGAGGC AACTGACTTG TCCAAGACAA CCCAGCGGTC AAGGTCTGTA 9 72 0 

CGAGCAGATG TAATGGCCAT AAATTCTTGG CTATCCTTAC CACCCATGGC TCCACCGTCA 9780 

CCAATAATAG CCTTGAAGTC TAAACCACTA CGAGTGAAAA TACGCTCATA GGCTGCTTTG 9840 

TACTCATCAT AAACACTATC CAAACTATCA TAGTTAGCGT GGAAACTATA AGCATCCTTC 9 900 

ATGATAAACT CACGTGTACG AAGAAGTCCA TTACGCGGGC GTTTTTCATC ACGATACTTG 9960 
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GGCTGAATTT GATAAAGGTT GAGTGGCAAT TGCTTGTAAG ATTTAACAGA ATCACGGACA 10020 

ATAGCTGTAA AGGTTTCTTC GTGAGTTGGA CCTAAGATAA AGTCTGATTT TTCACGGTTT 100 80 

TTTAGTTTGT AAAGGTCTTC ACCATAGGTT TCGTAACGAC CTGATTCACG CCACAATTCT 10140 

GCACTAAGAA GGGCTGGAGC CAACATCTCA ACAGCACCAA TCTTTTCGAA TTCTTGGCGC 102 00 

ATGATGTTTT TAGCTTTTTC AATCACACGG TTGGCAAGTG GTAGATAAGA ATAAACACCT 102 60 

GCTGAAACTT GGCGAACATA ACCAGCACGC AACATAAGAG CATGGCTGAT AACTTGAGCA 1032 0 

TCGCTTGGCA TTTCGCGAAG CGTTGGGATA GGCATTTTAC TTTGTTTCAT AATATTCCTC 103 80 

GATTATCTAA AAAAGAGTCG CATAATGTCA TTCCAAGTCA CAGCAATCAT CAAGACAACC 10440 

ATGATGACCA CTCCGGCCAA GGTGACATAG GTTTCAATTT CTTGTTTCAA TGGTTTGCGG 10500 

CGGATGGCTT CTAGGATATT GAGCACAATC TTACCACCAT CCAAGGCTGG AATCGGAATA 10560 

AGATTAAAAA TCCCAATATT GATGGAAATC ATTGCCAAGA AGTACAAGAT ATTTTCAATT 10 62 0 

CCATTTTTAG CAGCATCACT ACTTGCCTTA AAGATAGCAA CAGGTCCACC CAACTTGTTC 10680 

AAATCTGGTT GGAAAATCAG ATTTTTCAGA GCTGAGAGAA TTCGGAGAGC TGAGTCAGCA 10740 

GCAGTTGTAA AACCACCTAC AAACATGGAT AGAAAATCTG ACTTAACCCC CGGTTGAACA 10 800 

CCTAGAAGGT AACGACCTTG ACTATCTTTG GGTGTAACAG TGACTTGTTT GTCACTCCCC 10860 

TTTTCAGAAA TAGTCACATC CAAAGTCGGT GCCGTCTTAT CTTTGGTTTC TGTTTCCACA 1092 0 

GCTTGGATCA AGCTTTCCCA GTTGCTAACC TCATGTGAGC CAATCTTGGT AATTTGTGCC 10980 

ATTTCTGGTA CTCCTACCTT GGCCAAGGCA CCTTGGGGCA TGATATGGAA CTGATTGGTA 11040 

TCAACATCTC TGACACCACC CTGCATAAAG ATTAAAACCC AAAAAACAAC GACACCTAAG 11100 

ATAAAATTGT TCATAGGACC TGCAAAATTG GTAATCAGTT TGCCCCAGAT AGTCGCATTT 11160 

TGATATTGAA CATCTAAAGG TGCAATCCGA ACCTCAGTAC CATCTGCTTC CACAACCGTT 112 2 0 

GCATCGTGAT CCACTGCAAA TGTTTTTTCT TCTTCCAGAA CCAATCCTTT GATAAAGAGC 112 80 

TTGTCTTCAA AATCAAACTG GGTCACCTGC ATAGGGAGGG CTGTTTGATC CAATTTTTTA 11340 

CCTGAGAGAT TGATGCGTTT AACCTTACCA TCATCAGCAA GTGTCAAACT AACAGGCGTT l14 00 

CCTGTCTTGA TTTCAGTTGT ATCATCACCC CAACCGGCCA TGCGGACATA GCCACCCAGA 114 60 

GGCAAGATTC GAATGGTATA GGCCGTTCCA TCCTTGCCAA TGTGAGCAAA AATTTTAGGT 1152 0 

CCCATACCGA TGGCAAATTC ACGTACTAAA ATCCCTGATT TCTTGGCAAA GTAGAAGTGA 11580 

CCGAACTCGT GCACCACTAC AATAATCCCG AAAACCAGAA TAAAGGTTAA AATTCCGAGC 11640 

ATAGCGTTTC CTCCGTCTTT TGATTAAAAG AGTCCAAATA AGTGCATGAT TGGAAATACA 11700 

AGCAACATAC TATCGAAACG ATCCAAAACA CCACCATGTC CAGGGATAAA TTTCCCAGAA 117 60 
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TCCTTAACAC CAAAATGACG TTTGATCGAA CTTTCTAGTA AATCACCAAA TTGTCCAGCA 1182 0 

ATGCTAAAGA AAAT AG C AAA GACTGACATC TTGTAAATTC CATATGGAAG AGCAACTGTA 11880 

CTGTCAACTA TCATAAGGAT AATGGTTACT AAAATTGCTC CTAAAATACC ACCCAAGGCA 1194 0 

CCCTCAAGGG TTTTATTAGG CGATACCCTT GGTGCTAACT TTCGTTTCCC ATAGTTCATC 1200 0 

CCAACAAGAT AGGCACCACT GTCTGTCGCC CAGACGATAC ACAAGGCTAA GAGAGCCTTG 12060 

TCCAAACCTG CAACACGAGC ATCTAGTAAA GCATTAAATC CAAAGCCCAC GTAGAAGCTC 1212 0 

ATAGCAAGAG GGAAAACCGC ATCCTCAATC GTATAAGACT TGCTAAAAAC GGTCGTTCCT 12180 

AACATGATTG AAATCAAAAC ACTATAGGCA ACCACATTCC CATCAACTGG CAAAAAAGTC 1224 0 

AGGTAATTCT CCAAGGGAAT GGTCAATGCA AAGGTTGCAA AGAGGGTCAA GAGGCCCTCC 12300 

ATCGTCATGG TCTCTAGACC TCTCATCTTC AAAAGTTCAT GCATGGCTAG CATGGCTATG 123 6 0 

ATTCCGATTG CTATCTGAAG CAAGAGGCCC CCAATCATTA AAATTGGTAG GAAAATAGCC 12420 

AGGGCAATCC CTGCAAACAA GGTTCTTTTC TGTAAATCCT GGGTCATATT TCCTCCTAAA 124 8 0 

CTCCTCCAAA TCGGCGATGA CGACGATTAT AGGCAAGAAT AGCTTCCTGC AAGGCCGCTT 1254 0 

CGTCAAAATC AGGCCATAAG GTGTCCGTAA AATAAAGCTC ACTATAGGCT CCCTGCCATG 12 600 

GAAGGAAATT GCTGAAACGT AATTCTCCAC TAGTACGGAT AATCAAGTCT GGGTCTCGTA 12660 

AGTCCTTAGG CAAATGCTGA GTAAAGAGAT AGTTACCAAT CAATTCCTCT GTGATGTCAC 1272 0 

CTGGGTTGAT TTTGGCATCT AAAACATCCT GGGAAATCAA CTTAAGCGCC TGTGTAATCT 12780 

CAGCACGTCC ACCATAGTTA AGAGCAAAAT TAAGAATCAA TCCTGTGTTG TTCTTAGTCA 12 840 

ATTCCTCAGC CTTGGTTAAA GCTTCAAAGG TTTGCTTAGG CAGGCGGTCT GTCTCCCCAA 12 900 

TCATTTGAAT CTTAACATTA TTCGCATGTA GTTCCGGGAC ATAATTATCA TAAAACTCTA 12 960 

CTGGCAAGTT CATGATAAAC TTGACTTCCT GATCTGGACG GGTCCAGTTT TCCGTAGAAA 13 020 

AAGCATAGAC CGTAATAACC TTGACGCCCA GTTTCTTGGC TGCCTTGGTC ACGGTTTGCA 13080 

ATGCTTCCAT GCCCGCCTTA TGTCCAAAAA CTCGCGGTTG CATACGTTTT TTAGCCCAAC 1314 0 

GGCCATTGCC ATCCATGATG ATGCCGATAT GAGCAGGAAC CTGTGTCGGA ACCTCTACTT 13200 

CCACAGCCTT ATCTTTCTTA AAAAATCCAA ACATGATCTT ATTCCTATTC AAAAATCTAT 13260 

CGTTTCATTA TACCATATTT CCCCATTTTC TTCTATCACT AAGCTATTTA TTCTCAGGCA 13320 

CCAAGCCCAT TTTTCAAAAA AATAAGCCGC CTGATTGGGC GACTTTATTT TTATAGGGAG 13380 

ATTATTATGA AAAAGTTTTA GGAGTTTAAG TTAAGGTCTT CTTAACTTAT GAACTTAGTG 13440 

TACACTCCCT AGCTTAAAGT TTCCTTAAGT ATTTTTAAAA ATCAAATTTT TCCATTTCTC 13 500 
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CTGCCAATTT TTCTTGGATA AACGTGTTTG ATAGAGTTCC ATTCGGTCTT CATTTTCTAA 13 56 0 

GAAATGAGGA GTTGGACGAA CTTGAAAATT CAAAATATCC TCCAAACCAT AAGGTACATA 13620 

GAGTTCAAAA TCTAATTCTT CATTCAAGCG CAGTCCAACT GCCGTACACC GTTCTGGATA 13 680 

CTTACTCATA GCATCACGAG AACTGGTATA GGAAGCAGTG TGAGGACTGT GCTGATGCAT 13740 

ATAGACCTGA TTTTTCAATT CCCACTGGTA CTGAGGAAAA TCCTCTCTCA GCTTTTTCTC 13800 

CAGTAATAAG GTTTCCTCAT AAGAAAAATC TGGATCAAAG AAAATCACAT CTATATCTGT 13 860 

TTCATGATCA AAAGGGGATT TGTCTGACAA AAGATTCCAG ATGAAATTTC TGACAGAACC 13920 

TGCTGCCAAC CACGAGTCTT TCAAACCAAG GTCTCGGATG ATCGTCAGAA TGGCCATCAT 13 980 

ATCTGGACTT TCTCTAAAAG CCTCTAAGAT TTCTTGCTTA TTTTTCACTG TATTCATAAC 14040 

CTAAGTGCTC ATATGCCTTA GCAGTCGCCA CCCGTCCAGA CCGTGTCCGC ATGATAAAAC 14100 

CTTTTTGAAT CAAGTAAGGC TCATACATGT CTTCAACTGT CTCACGCTCT TCGGCGATAT 14160 

TCACAGAAAG AGTTCCTAGA CCAACAGGTC CTCCACTGTA CATCTCAATC ATGGTGCGAA 142 2 0 

GGATTTTTTG ATCCACATAG TCCAAACCTT CATGGTCAAC ATCCAGCATA GTCAAAGCCT 142 80 

TATCGGTAAT AACATCATCG ATAACCCCAT TCCCCATTAT CTGGGCAAAA TCGCGCACGC 14340 

GCTTGAGGAG ACGATTGGCA ATACGAGGGG TTCCACGACT ACGTAGGGCC AACTCAGATG 14400 

CTGCCTCATG GGTGATTTCC ATCTCAAAAA TATCTGCCGT CCGCTCGACA ATTTCTGTCA 144 60 

AGTCAGCATG AGCATAATAC TCCATATGAC CTGTAATCCC AAAACGTGCC CGTAGTGGAT 14520 

TTGAGAGCAT ACCAGCCCGA GTCGTCGCAC CAATCAAGGT AAAAGGAGGC AACTCCAAAT 14 580 

GAACACTGCG ACTGCCTTCA CCAGCCCCAA TCATAATATC GATGTAGAAG TCCTCCATGG 14 640 

CACTATAAAG CACTTCTTCC ACTGACATGG GTAAGCGATG AATCTCGTCA ATAAAGAGGA 14 7 00 

CATCTCCAGG CTCTAAATCA TTCAAAATCG CTACCAAATC ACCCGCTTTT TCGATAACAG 147 60 

GACCAGACGT TTGCTTGAGA TTGACTCCCA GTTCATTGGC AATGACAAAA GCCATGGTTG 1482 0 

TTTTCCCAAG CCCTGGAGGG CCAAATAAGA GCACATGATC CAGCGCTTCA TCCCGCATTT 14 880 

TAGCGGCTTC G AT AAAG AT C TGAAGTTGAT CCTTAACCTT ATCCTGACCA ATATATTCAC 14940 

GTAAATACTG AGGACGGAGC GTGCGTTCTA CTAACTCCTC ATCACCCATC ATCTCATTAT 15000 

CTAAAATTCT ACTCATGGCT CTATTATATC AAAAAAAACA AGCCACAAAC AAAAAAGCCA 15 060 

CCTGATTGGG TGACTCCTAA GTTTAGCACT TATGTGGTAT AATATTATAC GGCACTTCTA 15120 

CACCGCCTAC GAAAGGAGGT GAGATAGCCC ATGATGGAAT TAGTACTCAA AACTATTATC 15180 

GGACCAATTG TGGTCGGTGT CGTTCTTCGT ATAGTCGATA AATGGCTAAA CAAGGACAAA 15240 

TAGTGTCAAA AAAGACCTCA AGCTTATTTG GTCGTGAGCT TGGGGTCTTT TCTAGCCTAT 153 00 
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GATATAGAAC TAGTACTCAA TTCCTTTTTA TTATCCCATA GTTCACGAAT TTTGTCAAAA 15360 

CTTTACATTT TCTTCAACCG CTGTACGACA AGACGGTTAA GATTAAGAGA ACGTTAGGGA 15420 

TTCTATCAAT TTCATAGAAA TTTTGATTTC GTAAACGAAG AGACAATCTT ACATGTCACT 15480 

TCTCATTTAA TACGCCACTA CTAGACAAGC AAAATCATTA TTACAGTAGT TCCAGTCCTT 15540 

CAATTAACAG TCACTTACAA TCAAATTGAG TTTGAACTAG CTGAAGCGAC CACAGACCTA 15600 

TTTCTTAGTC ATATTCGCTA AAAAAATCCC CGCCAAAATC TCAAAAAGTC CCCGCCAATT 15660 

CCCCGACCAA AATCCGAAAA ATACCGAAAA ATATCGAAAA ATTATTTTTA GAATAGTCCC 15720 

AAAAATCCTG AAATAGAGCT AAAAAACTCC ACCTGATTCG GTGGAGTTAA GGGAGATTAT 15780 

TATGAAAAAG AAAAGTTTAG GATTTTATTA AATAAAGTTA GGAGGTCTTT ATTTAATAAC 15840 

TACATGATAC AAGACGAAAC TTAAAACTAG CTTAACTTTT CTAAAATTTT ACTATTTTGC 15900 

AAAAAATTTC TATCACCAGC ACCTCACCAA TCGAGTAGGG GATAATCTCT AGCCCCTCTC 15960 

ACACCACCGT ACGTGCCGTT TGGCATACGG CGGTTCAACT AACTTTTAAC GCATGTCGTT 16020 

CAAGGTAATA ATCCAAACAC GAAACCAGTC CACGTTTTTC CAGGACTGGT TTTGATATAG 16080 

CACGTTTAAG TACCGACTTC TGAGCTACTA ATTGATAATG GTCGCCCCAG CCAGATACCT 16140 

TATCTGCTAT CCATTTAGGA ACTCCTAACT TAAGCAATCC CCATAATCGT CTCGATTTCT 16200 

TCTTCCATTG CTTCCAGATA ATCACTCGTA GGCGAGTACG CAAGCGCTCA TCTATGCTGG 16260 

CGACTATACT TTTCATATTT CCCAATGAGC AATAGTTTAT CCATCCTCGA ATAGACAAAT 16320 

TCAGTTGCTC AATACGTCTT GTTAGGTCTA TACTCCATTT CCTCTGTGTT AGTTTCTTCA 16380 

ATTTAAACTT AAATCTCCGA ACACTATCTT GATGTGGACG GCTTTTCCAA CCATCTGATA 16440 

ATTTCCAGAA CCCAAAACCT AGATATTTCA ACTCTCTTGG TCATGTTTAC TTTCAAACCT 16500 

AGCCGTTTCT CAATAAACGA CTGACTGAAT ACATC 16 535 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

CCAGAGCGTT GCGTCCGAAA GTCTATCCAG ACACGGCTCT TTAAAAACAA AAGGAGAAAT 60 

GATGCATACT TATTTGCAAA AGAAAATTGA AAATATCAAA ACAACCCTAG GTGAAATGTC 120 
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AGGTGGTTAC CGTCGTATGG TTGCGGCTAT GGCTGATTTA GGATTTTCAG GAACTATGAA 180 

GGCTATCTGG GATGACCTCT TTGCCCATCG TAGTTTTGCC CAGTGGATTT ATTTGCTGGT 240 

TTTAGGAAGT TTTCCTCTCT GGCTGGAGTT GGTTTACGAA CATCGTATTG TTGACTGGAT 3 00 

TGGGATGATT TGTAGCTTGA CAGGGATTAT CTGTGTAATC TTTGTATCGG AAGGTCGAGC 3 60 

AAGTAATTAT CTTTTTGGCT TGATTAACTC TGTTATTTAC CTTATTTTGG CCCTACAGAA 42 0 

AGGCTTTTAT GGTGAGGTGC TGACGACACT TTACTTCACA GTCATGCAGC CAATTGGACT 4 80 

TCTAGTTTGG ATTTATCAGG CACAGTTTAA GAAGGAAAAG CAGGAGTTTG TCGCGCGTAA 54 0 

ACTGGACGGC AAGGGCTGGA CAAAGTATCT TTCCATTAGT GTGCTTTGGT GGTTGGCCTT 60 0 

TGGCTTCATT TATCAGTCTA TTGGTGCCAA TCGTCCCTAT CGTGATTCAA TCACAGATGC 6 60 

AACCAATGGG GTAGGGCAAA TCCTCATGAC AGCTGTTTAC CGTGAACAGT GGATATTCTG 72 0 

GGCGGCTACC AATGTCTTTT CAATCTATCT CTGGTGGGGA GAAAGCCTGC AAATTCAAGG 7 80 

GAAATATCTA ATTTATCTCA TTAACAGTCT AGTTGGTTGG TATCAATGGA GCAAGGCAGC 84 0 

TAAGCAGAAT ACTGATTTAC TTAACTAGGA AAAGATGTTT GAAAGTGCTG TTTTGAGATT 9 00 

TCGATTAAAA CAGATATAGT TGATAATCAA GGATTTATAG TATGAAAAAG AGGATCGGCG 960 

GGTCCTCTTT TGTTGTTGAA AAGATAAAAA ACTCAGTAAC CTAGAAATAA GACAACTGAA 102 0 

GCTTTACTCT ATATTCAATT TTTAGGAATG AGAAGGTCTA GATAAAATTG GACAACTTCC 1080 

TGGTCTGTGA AATCTTGACC TTTTTTGAGC CACCAGGTCA ATGTCTCGAT AAAGTTGGAC 1140 

ATGACCAAGT GTTGGAGGTA AGAAGTAGGC AGATTAGGGT GGGCTTCTTT TAAATTATCA 1200 

GCTAGCACGG AATAGACATG GTGTTCTAGC TCTTTATGGA GTTGACGGAG GAAGTAGTCA 1260 

TTTTTGGAAA ATAGCAGACT GGTGATATGG TCTTGGTTTT TATGAAAATG GAGAAAGAGG 132 0 

TGGGCGAGGT AGTCCTCGGT TGAAATGGCT TGCTCTCTTT CAAAAAGATG ATGGAAGAGG 13 80 

TAGCGGCAGA GCTGGTCCAG AAGAAGCTCC TTACTCTCAT AGTGACAGTA AAAGGTGGAT 1440 

CGTCCCACAT CTGCGAGATC AATGATATCC TGAACAGTAG TGGCCTCGTA GCCCTTAGCA 1500 

TTCAAAAGTT GTATAAAAGC TTGATAGATG GCTTTTTTGG TTTTGCTGAT ACGGCGGTCA 1560 

ATGTTAGTCA TATGGACACT TAAGGCAAAT TGTTCAGAAC TGAATAAAGC TGACGTTTTG 1520 

CTTCTATCCT TTCTTTGAGT TTTAGTGGAT AATGATAATG AACAAGGTGT TCATAAATCT 1680 

ATTATAACAA AGGAATGAGA AATATGAAGG CAAAATATGC TGTTTGGGTG GCTTTTTTCT 1740 

TAAATTTGAC TTATGCCATT GTTGAGTTTA TTGCAGGTGG AGTATTTGGT TCTAGCGCTG 1800 

TTCTTGCTGA CTCTGTGCAT GACTTGGGAG ATGCGATTGC AATTGGAATA TCAGCTTTTC 1860 

TAGAAACAAT CTCCAATCGT GAAGAAGACA ATCAGTACAC CTTGGGCTAT AAGCGGTTTA 1920 
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GCCTGCTAGG AGCCTTGGTA ACAGCTGTGA TTCTCGTAAC GGGCTCTGTT CTAGTCATTT 198 0 

TGGAAAATGT CACGAAGATT TTGCATCCGC AACCAGTCAA TGATGAGGGG ATTCTCTGGT 2040 

TAGGAATTAT TGCGATTACT ATCAATCTGT TAGCGAGTCT GGTGGTTGGT AAGGGAAAGA 2100 

CAAAGAATGA GTCTATTCTG AGTCTGCATT TTCTGGAAGA TACGCTAGGG TGGGTAGCTG 2160 

TTATCCTGAT GGCGATTGTT CTTCGATTTA CGGACTGGTA TATCCTAGAT CCTCTTTTGT 2220 

CCCTTGTCAT TTCTTTCTTT ATTCTTTCAA AAGCCCTTCC ACGTTTTTGG TCTACACTCA 2 280 

AGATTTTCTT GGATGCTGTG CCAGAAGGTC TTGATATCAA GCAAGTAAAG AGTGGCCTGG 2 340 

AGCGATTGGA CAATGTGGCC AGCCTTAATC AGCTTAATCT CTGGACTATG GATGCTTTGG 2400 

AAAAAAATGC CATTGTCCAT GTTTGTCTAA AAGAAATGGA ACATATGGAA ACTTGTAAAG 2 4 60 

AGTCTATTCG AATTTTCCTA AAAGATTGTG GTTTTCAAAA TATTACCATT GAAATTGATG 2 520 

CTGACCTAGA AACTCACCAA ACCCATAAGC GAAAGGTGTG TGACTTGGAA CGGAGTTATG 2580 

AGCATCAACA TTAGAAAAAA GTGAAAAATA CTTGGGTACT ATCTTATTTG GAATAGAGTA 2 640 

ATTTCTTTAT TATTTAAATA TTTCAAAAAT TGGTAAGAGA AGAGCATTGT ATAAACTCCA 2 700 

GATATATGAT TGTTAATGAT AAAAATTTTT CGATTAGATA CAAAATGCTT GACTTGGAGT 2760 

CAACTCAAAG TTATATAATA AGATAAGTGA GTTAGAATAG CGTGAATTCA GTGAATGAAA 2 820 

TGAGAGGAGG TTAGCGTGTG AATATTAAAT CTGCCAGTGA TTTGTTGGGA ATTTCAGCGG 2880 

ATACGATTCG GTATTATGAA CGGGTTGGTC TTGTGCCACC GATTACTCGT ACTGCTACTG 2 940 

GGATTCGTGA TTTTCAAGAT CAGGATATCG AAGCGCTGGA ATTTATTAAG TGTTTTCGTT 3 000 

CGGCGGGTGT CTCTGTAGAT AGTTTAGTTG ACTATATGTC GCTCTACCAA AAGGGAGATG 3 060 

AAACGAGAGA GGAGAGGCTT GGTATTTTAG AAGAGGAAAA GCAAAAATTA GAGGAGCGCT 3120 

TGTCTCAGCT ACAGACAGCT TTAAATCGTT TAAATCTCAA AATTAAACTT TATAAGGAAG 3180 

GAAAATTTTA AATGAAATCA GCAGTATATA CAAAGGCAGG TCAGGTTGGA CTTGCTAGCA 3 2 40 

TTGAACGTCC GCAAATAATA GAAGCGGATG ATGTGATTAT TCGTGTGGTT CGTGCGTGCG 3 3 00 

TTTGTGGTTC AGATTTATGG AGGTACCGTA ATCCAGAAAC GAAAGCTGGA CACAAAAATA 33 60 

GTGGACACGA AGCGATTGGG ATTGTTGAAG AAGCTGGGGA AGCCATTACG ACGGTGAAAG 3420 

CAGGTGATTT TGTGATTGTC CCTTTTACAC ATGGATGTGG TGAGTGTGAT GCCTGTCTTG 3 480 

CTGGATTTGA CGGTTCTTGC GACAATCATA TTGGCAATAA TTTGGGGGGT GATTTTCAGG 3540 

CAGAATATAT TCGCTTCCAC TATGCAAACT GGGCGCTGGT TAAAATCCCT GGTCAACCTT 3 6 00 

CTGACTATAC AGAAGGGATG CTCAAGTCCC TTTTGACTCT TGCAGATGTC ATGCCGACAG 36 60 
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GCTATCATGC GGCGCGTGTT GCAAATGTTC AAAAAGGGGA CAAGGTTGTT GTTATCGGTG 3 72 0 

ATGGGGCTGT TGGTCAATGT GCTGTCATCG CGGCTAAGAT GCGTGGAGCA TCACAAATTA 3 7 80 

TCCTTATGAG CCGTCATGAA GACCGTCAAA AGATGGCTAT GGAGTCAGGT GCGACAgcTG 3840 

TTGTTGCAGA ACGTGGTCAA GAAGGAATTA CCAAGGTGCG TGAAATCCTC GGTGGAGGAG 3900 

CAGATGCAGC ACTTGAATGT GTTGGTACGG AGGCTGCTAT AGAACAGGCG CTAGGTGTTC 3960 

TTCATAATGG AGGGCGTATG GGCTTTGTAG GAGTCCCACA CTATAATAAT CGTGCTCTTG 402 0 

GTTCGACATT TATGCAAAAT ATCTCTGTAG CAGGTGGGGC AGCTTCTGCT ACAACATACG 4080 

ATAAGCAATT TTTACTAAAA GCCGTCCTTG ATGGTGATAT CAATCCAGGT CGCGTCTTTA 4140 

CTTCAAGTTA TAAACTGGAA GATATCGACC AAGCCTATAA AGATATGGAT GAACGTAAGA 42 00 

CAATTAAGTC TATGATTGTA ATCGAATAAA AAACGAATAG GAGTTTTAGA ACTCTATTCG 4260 

TTTTTTATGT TATCCTATTC TTGATTTAGG GTACTTTCTC TTAATGTCAG TCTGGTTCCC 432 0 

AGCATGGTCA GGCTAGGGAT TTTCCGACCG TGGAGGACTT CCTTGTTAAG AATATCCATA 4380 

CCTGCTCGGC CCATTTCTTC AGTATAAACT GTAATACTAG AGAGGGGAGG ATAGACCTGT 444 0 

TTGGTCAGAC TAGTGTCGTT AAAGGAAATG AGGCTGACGC GATCTGGCAG GCTGATTCCA 4500 

GCTTCTTGGA GGGCACGGAG GGCACCGATA GCTAAACTAT CGCTGGCTGC GAAAAATGCT 4560 

GGCGGAAGTT GGTCTCCCAA GCTCTGAATG GCCTCCTTCA TTAAGTCATA GCCAGACTGG 462 0 

GCAGTAAATC TTCCTTGAAA GACCAGTTCA TCATGATAGA TTCCCCTCGC TTGACTATAG 4680 

TTTTTGAAGT TTTCTAGACG CTTGTCCTGA ATGATTTCTT CTTGGTCTGT TGTTTCTTCA 4740 

AGGCCTGTTA GAATCCCGAT ACGGTCCATT CCTTGACTGA GGAAATAATC GACAACCTGT 4800 

TTCATAGCAG TGTAAAAATC CGTGATAATA CAGGTATGTC CCAGGGAAAG TGTATCGCTG 4 860 

TCTAGAAATA CAAGAGGCTT TTGGTATTCT TCAAAGGCAG AAATCTGAGC TCGACTAAAC 4 920 

TTTCCGATGC AGAGAATCCC AATCACTTCC TCGCTTAGGG TAAAAGGGTG GTCATTAAAA 4 980 

TAGCGCAAGA TATCATAGTC CAACTCTTGG GCTCTTTTTT CTATTCCTAG GCGAATCTGG 5040 

TAGTAGTAGA GGTCGTCCAG CTCCCCTTGT TCGCTGACCC ATTGGATAAT GGCAATCTTT 5100 

TGCTTGGGTT TGTGGGACTC GCCTGTCTTG AGGTGCTTGG TGTAGCCCAG CTCTTCAGCA 5160 

ACGGTTAAAA TACGGTGTCT GGTTTCTTCT GTAACAGATA GGCTCTGGTC GCGGTTGAGG 5220 

ACGCGGGATA CGGTCGCGAT AGAGACAGAG GCTAGCTGTG CAATGTCTTT TAAGGTAGCC 5280 

ATAAATCCTC CTTGATTAGG TTAGTATATC ATGTTTTTCT TCTTTTTACT GATATTTTAC 53 40 

TAAAATTTTA GTAAAAAGGA TTGACCTTGG AAAATTCCTT GGATATAATA GAAAGAAAAC 5400 

GATTACACGT TAAGATGGCT TAACGGACAG TCAAAGGAGA ATTCATATGG CACAACATCT 5460 
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TACTACTGAA GCCCTTCGCA AAGACTTTCT TGCTGTTTTT GGTCAAGAAG CAGATCAAAC 552 0 

CTTCTTTTCA CCAGGCCGCA TTAATTTGAT TGGTGAACAC ACAGACTACA ACGGTGGGCA 5580 

CGTTTTTCCT GCTGCTATTT CCTTGGGAAC TTACGGTGCA GCTCGTAAGC GTGACGACCA 564 0 

AGTCTTGCGT TTCTACTCAG CTAACTTTGA GGACAAGGGC ATTATCGAAG TGCCTCTCGC 5700 

TGACCTCAAG TTTGAAAAAG AGCACAACTG GACCAATTAT CCAAAAGGTG TCCTTCATTT 5760 

CTTGCAAGAA GCTGGGCACG TGATTGACAA AGGTTTTGAT TTTTATGTTT ATGGAAATAT 582 0 

TCCAAATGGT GCTGGCTTGT CTTCTTCTGC ATCCTTGGAA CTCTTGACAG GAGTCGTGGC 5880 

TGAGCATCTC TTTGATTTAA AATTAGAGCG TCTCGATTTG GTTAAAATCG GCAAACAAAC 5940 

AGAAAACAAC TTTATCGGAG TAAACTCTGG CATTATGGAC CAGTTTGCTA TTGGTATGGG 6 000 

GGCAGACCAA CGTGCTATTT ACCTAGATAC TAATACTTTA GAATACGACT TGGTGCCACT 6060 

TGATTTGAAG GACAATGTCG TTGTTATCAT GAACACCAAC AAACGCCGTG AATTGGCGGA 6120 

CTCTAAATAC AATGAACGTC GTGCTGAGTG TGAAAAAGCA GTGGAAGAAT TGCAAGTTTC 6180 

CTTGGATATT CAGACTCTGG GTGAATTGGA CGAGTGGGCC GTTGACCAAT ATAGCTATCT 6240 

GATTAAAGAT GAAAATCGTT TGAAACGTGC TCGCCATGCT GTGCTTGAAA ACCAACGTAC 6300 

CCTCAAAGCT CAAGTAGCAC TCCAAGCAGG AGATTTGGAA ACATTTGGAC GCTTGATGAA 63 60 

TGCGTCACAC GTTTCTCTGG AGCATGATTA TGAAGTAACT GGTTTGGAAT TGGATACCCT 64 2 0 

TGTTCACACA GCTTGGGCAC AAGAAGGAGT TCTCGGTGCT CGTATGACAG GGGCTGGTTT 6480 

TGGTGGCTGT GCcATTGCCT TGGTTCAAAA AGATACTGTT GAGGCCTTTA AGGAAGCTGT 6540 

AGGCAAACAC TACGAGGAAG TAGTTGGATA CGCTCCAAGC TTCTATATCG CTGAAGTTGC 6600 

AGGTGGCACT CGCGTCCTTG ACTAGTCAAA AGGAGGCTCT ATAGTGACCT TAGTAAATAA 6660 

ATTTGTAACA CATGTCATTT CTGAAAGCTC ATTTGAGGAA ATGGATCGAA TCTATCTGAC 6720 

CAATCGTGTT TTGGCACGAG TGGGAGAAGG TGTTTTGGAA GTTGAGACCA ATCTGGATAA 67 80 

ATTGATTGAC CTCAAGGACC AGCTGGTTGA AGAAGCCGTT CGATTAGAGA CGATTGAGGA 6840 

TAGTCAGACT GCGCGTGAAA TCCTTGGTGC TGAACTGATG GATTTGGTGA CTCCTTGTCC 6900 

AAGTCAGGTC AATCGTGATT TTTGGGCAAC CTACGCCCAC TCTCCAGAAC AAGCGATAGA 6 960 

GGATTTTTAC CAACTCAGTC AGAAAAATGA CTACATCAAA CTCAAGGCCA TTGCTAGAAA 7 02 0 

TATCGCTTAT CGTGTTCCAT CTGACTACGG AGAACTTGAA ATTACCATCA ATCTCTCTAA 7 080 

GCCTGAAAAA GATCCCAAAG AGATTGTGGC AGCCAAGTTG GTGCAAGCTA GTAATTATCC 7140 

TCAGTGTCAG CTTTGTCTAG AGAATGAGGG CTACCATGGT CGAGTTAACC ACCCAGCTCG 72 00 
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TAGCAATCAC CGTATTATCC GTTTTGAAAT GGTTGGTCAG GAATGGGGTT TCCAGTATTC 72 60 

GCCCTATGCT TACTTTAATG AGCATTGTAT CTTTTTAGAT GGCCAGCATC GTCCCATGGC 732 0 

CATTAGTCGT CAGAGTTTTG AACGTCTGTT GGCTATCGTA GACCAGTTTC CAGGATATTT 73 80 

TGCTGGATCT AATGCCGACC TGCCGATTGT GGGGGGCTCT ATTCTAACTC ATGATCATTA 744 0 

TCAGGGAGGC CGTCACGTAT TTCCTATGGA ATTGGCTCCC TTGCAAAAGG CCTTCCGATT 7500 

TGCTGGTTTT GAGCAGGTCA AGGCTGGAAT TGTCAAGTGG CCCATGTCTG TCCTACGTTT 75 60 

GACTTCGGAT TCCAAAGAGG ATTTGATCAA TTTGGCTGAT AAGATTTTGC AGGAATGGCG 7 62 0 

CCAGTATTCA GATCCTGCAG TGCAGATTTT GGCAGAGACA GACAGGACAC CGCATCACAC 7680 

TATCACACCC ATTGCCCGCA AACGCGATGG ACAGTTTGAG TTGGACTTGG TCTTGCGAGA 774 0 

CAATCAGACT TCAGCAGAGT ATCCTGATGG TATCTATCAT CCCCACAAGG ATGTCCAACA 78 00 

TATCAAGAAG GAAAATATCG GCTTGATTGA GGTCATGGGC TTGGCAATCT TGCCACCACG 78 60 

TCTGAAAGAA GAAGTGGAGC AAGTCGCTAG CTATCTTGTA GGAGAAGCTG TTACAGTTGC 7 92 0 

CGATTATCAT CAGGAGTGGG CAGACCAACT CAAATCCCAA CATCCAGACT AACGGATAAA 7980 

GAAAAAGCCC TTGCAATCGT CAAGGACTCT GTGGGTGCTA TCTTTGCGCG TGTACTTGAG B04 0 

GATGCAGGAG TCTACAAGCA GACAGAACAA GGGCAGACAG CCTTTATGCG CTTTGTGGAA 8100 

CAGGTCGGAA TTTTACTAGA CTAGGAGCTT TCTCGG 813 6 
(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10011 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

CCCATAGTGA AGAGTGGCCA TAAGAAGGTC TTCTAGGCTT AATTTAGGTT TTCGTCCACC 60 

TTTTGCGTGT TTAAGTTGAT AAGCTGTTTT TAACACAGCT GAACATCTCT TCAAAAGTCG 12 0 

TGCGCTGAAC ACCAACAAGA CATTTAAATC GTGTATCAGT TAGTTGTTTA CTTGCTTCAT 180 

CATTCATAGA ACTACTATAC CATGTTTTGT TTCGCAGGAA GTCTAATATT GTCAAATACT 240 

GGAACGCTCA TTGCTGGGAT ACGGAATAAG ATTGGCCCAG CTTCGATAAC TGGGATACCT 300 

GGTTCAAAAC CAAGGTCTGT TGCAGCGATT GGTGTAAAGA TATCGTAACC TTTCATAAGG 360 

TCTTCGTTTA CATCTTTCAC CATAACTGCA TCACAGTGAA CATCGTAACC ACGGTTTGAA 420 

AGTTCTTCTT CTAGAGCACT TTTAATTTGG TGACTTGAGT TAACACCTGC ACCGCAGGCA 480 
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GCAAGAATTT TAATCATTTG GATTTCCTCC GATTTTATTT TTTAATAGAC AAGATTAAGC 540 

GGTTGCTTCA GCAATGTAAG CATAAAGGGC TTCTGGTTCA GAAATTTTTG ATAGGTCTTC 600 

AAGATGACCA TTTCCTGTGA AGAAGTCCAT TAACTGAGCA AGAATGTTCG TTTGACTTGA 6 60 

ACTTGAATTA TTGATGATAA AGAAGAGCAA GGATACTTCT ACTTCCTTAC CTGGCGCAAT 720 

CATATTATGG AAAGTCACCG GTTTCTCTAA TCGAACAACC ACCACTTTCT CAGCTAGATT 7 80 

ATGAACAATA TCTGTGTGAG GAATCATTAC ATTTGCAAGT CCTTTCCTAG AAATTCCATA 84 0 

TATAAACCAG TTGGAAATGA CTTTTCACGC GTGATCAAGG CTTCACGATA AGTTGGAGTG 900 

ACAATTTCTC GTTCTTCCAA CAAGCTTGCT ACCTGATCAA AAAGTTATTC TTGATTATCC 9 60 

GCTTCTAAGC AAAACACAAG GTTTTTGTCA AAGAAATAAT CTAATACCAT AAGGTTTTCC 102 0 

CTTCTTTCCA TTAACTTTAT GCTATAAGTA TAACACTATA TGAAATCGTT GTTAATTACT 10 8 0 

TTCTATTCTT TTTTGTCTCT TTTTTTATAT TTTTGTTTTG TTTATAGTTT GTTATATAAA 1140 

AATAAACACA CAAACAAATA CTCCAAGCAT TTTTCTGTTC TAATACTCAA TGAAAATCAA 12 00 

AGAGCAAACT AGGAAGCTAG CCGCAGTTGT TCAAAACACA GTTTTGAGGT TGTAGATGAA 12 60 

ACTGACGAAG TCACTCAAAA CATGGTTTTG AGGTTGTAGA TGAAACTGAC GAAGCAACAg 13 2 0 

CCATACATAC GGTAAGGCGA CGCTGACGTG GTTTGAAGAG ATTTTCGAAG AGTATAAAAA 13 80 

CTAAAAAAGC AGACCATCTA AGCCTGCTTT ACTATTGATT CTTATATAAA TTTCCTGTGA 1440 

ACAAGGAAAG GCATTTCTGA TAACTTATTC TTCATCCATA CTCAAGACGC TGAGGAAGGC 1500 

TTCTTGCGGA ACTTCAACTG ATCCGATGGA TTTCATGCGT TTCTTACCAG CTTTTTGTTT 15 60 

TTCAAGGAGT TTACGCTTAC GAGAAACGTC ACCACCATAA CATTTAGCAA GTACGTTCTT 162 0 

ACGAAGGGCC TTGATATCAG TACGAGCGAC AATCTTGTGT CCAATAGCCG CTTGGATTGG 168 0 

AACTTCAAAT TGTTGGCGAG GGATGATTTT CTTGAGTTTA TCAACGATGA GTTTCCCACG 1740 

TTCGTAGGCA AAGTCCTTGT GAACGATAAA GCTGAGGGCA TCCACCTTAT CTCCATTGAG 1800 

AAGAATATCC ATTTTCACCA GCTTAGATGG GCGATATTCT GACAATTCGT AGTCAAAGCT 1860 

TGCATAACCA CGTGTCGAAG ACTTAAGTTT ATCAAAGAAG TCAAAGACAA TTTCAGCAAG 1920 

AGGAATTTGA TAGATAACAT TGACACGGTT ATCATCAATA TAGTCCATAG TCACAAAGTC 1980 

CCCACGCTTA CGCTGAGCTA GCTCCATTAC TGCTCCGACG AACTCCTGTG GTACCATGAT 2 040 

TTGCGCCTTG ACATAAGGCT CTTCAATGGT CGCAATCTTA GTTGGGTCTG GAAACTCAGA 2100 

TGGGTTAGAC ACATCCATAG ACTCACCGTC GGTCAAATTA ACTTTGTAAA TAACAGACGG 2160 

AGCTGTCATG ATGAGGTCAA TATTGAACTC ACGCTCTAAA CGTTCCTGGA TAACATCCAT 2220 
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ATGGAGAAGT CCAAGAAATC CACAACGGAA ACCAAATCCA AGTGCCTGAG ATGTTTCTGG 22 8 0 

TTCAAACTGA AG AC TAG CAT CATTCAGTTG CAATTTTTCA AGcGCTTCAC GCAGGTCATT 2340 

GTACTTGTTT GATTCGATTG GGTAGAGACC CGCAAAGACC ATAGGATTCA TCTGCTTATA 2400 

ACCATGTAAT GGTTCTGCCG CAGGATTGGT TGCCAAGGTA ACGGTATCAC CCACACGAGT 2 460 

ATCCTGAACC GTCTTGATAG ACGCCGCAAT GTAACCAACA TCACCAGTCG CAAGGAAATC 2520 

ACGACCAACC GCTTTTGGTG TAAAAATACC GACTTCGGCC ACATCAAAGG TCTTACTATT 2 580 

GCTCATGAGC TGAATCTTAT CACCAGGTTT GACCACTCCG TCCATGACAC GCACTTGGAG 2 640 

GATAACCCCA CGGTAAGCAT CGTAAACAGA GTCGAAAATC AAGGCCTTAA GTGGCGCCGT 2 700 

CACATCACCC GTTGGTGCTG GTACTTTTTC TACAATTTGC TCGAGGATTT CTTCAATCCC 2 7 60 

AATACCAGCC TTGGCAGAAG CCAAAACTGC TTCACTGGCA TCCAAACCAA TCACATCTTC 2 820 

AATCTCTGTA CGCACGCGCT CCGGATCTGC AGCCGGCAGG TCAATTTTAT TAATGATAGG 2 880 

CATGATTTCC AAATCATTAT CCAAAGCCAG ATAAACGTTG GCAAGAGTTT GAGCCTCAAT 2 940 

TCCTTGAGCC GCATCGACCA CCAAAATAGC ACCCTCACAG GCAGCTAGCG AACGTGAAAC 3 000 

TTCATAGGTA AAGTCAACGT GCCCTGGTGT GTCAATCAAG TGGAAAATAT AAGTTTCCCC 3060 

ATCTTTTGCA GTGTAATTCA ACTCGATGGC ATTCAACTTA ATAGTAATTC CACGTTCCCG 3120 

CTCTAGCTCC ATGCTATCCA AAAGCTGGGC CTGCATTTCA CGACTTGAAA CCGTCTCTGT 3180 

TTTTTCCAAA ATGCGGTCTG CTAGAGTTGA TTTTCCGTGG TCAATATGGG CGATAATAGA 3240 

GAAGTTACGG ATCTTCTCCT GTCGTTTTTT CAATTCTTCT AAGTTCATGA TTCTCTTCCT 3 3 00 

TTCAGGGTAT CTATTTATTA TAAATTGTTT TTGATATTTT GACAAGACCA TACCCTGCTA 3 3 60 

GGAGTACTAA TCTTCAGCGA CAAAGCCGTC ATTTTCGATA AAGTGGTGTT CTGTCATTCC 3420 

TTGGTCTGTA AAGACAATCC CGTGAAGGAC ACCACCATAA ACAGCTCCTC CATCCATTCC 3 4 80 

AATCTTGCCA TCTTCTGTAG TCCAAAGCTC AGATGTACCG CGTTCTTGCT GTAACAAACC 3 540 

ATAGACCGGT GTATGACCGA AGACAATGGT TTTTCCAGTA TGATTTTCAG CTCCGTGGAA 3 600 

TGGTTTTCTA AGCCATACTT TTTTATAATC TGTTGTTTCA TGCCAGTCGT CCAAGGTCAA 3 6 60 

ATCAATACCT GCGTGAACAA AGATATACTT GTCTGTCTCT ACTACAAATG GCATTTGACG 372 0 

AATGAATTCG ACCAAGTCTG CCGCTTCAGC GgCAACCCGC TTGGCATCTT CTACTCCATC 37 80 

AACTGGTGCA TCCAAGGGAC GACCTAGGAT AGAGTTAATG GTTGTATCTC CACCATTGCG 3 840 

ACTATAATGG TCATAACTTT CTTCTGGGTC ATCTAGCCAA GTCAAAAACA TATACTCGTG 3 9 00 

GTTTCCGGAC AAACAGATAG CCCCTTGATT GTCCACCAAG TCCTTGACCA TTTCAAGAAC 3960 

ACGGTGACTA TCCTCACCTC TGTCAATCAA ATCACCTAGA AAGAGCAACT GGGGCTGACC 402 0 
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ATCCCAGGTT TTGAGAAGGT CTTCCAGCAT CCCAGCTTTT CCGTGAACAT CTCCAATTAC 4 0 80 

ATAATAATCT GTCATCTTAT TTCTCCCTGT TTCTCAACAA TTCTCTTGCT TGCGTCAGGG 414 0 

CTGCTTCTGT CACATCATCA CCTGCCAACA TCTTGGCAAC TTCCTCCACT CGCTCTTCGA 42 00 

CCGTCAAGAG ACGAACAGTC GAAACCGTTG AATGGTCATT ACTAATCTTC TCAATAAAGA 4 2 60 

ATTGATAATC TGCAATCGCA ATTACTTGTG GCAAATGGGA GATAGCCAAA ACCTGACCAT 432 0 

GCTGACCAAT TTTATGAATT TTCTGAGCAA TAGCTTGAGC AACACGACCT GAAACTCCCG 43 8 0 

TATCCACCTC ATCAAAGACA ATGCTAGTCT TGCCTTCTTT ACGTGAAAAG GCAGACTTAA 4440 

TGGCTAACAT GAGACGAGAT AATTCCCCTC CAGAAGCAAC CTTAACCAAG GGTTTAAAGT 4 500 

CTTCTCCAGG GTTGGTTGAA ATATAAAACT CAACCATTTT ATTTCCCTCA CGACTGAATT 45 60 

TTCCCTTACT AAAACGAACC TGAAACTGGG CTTTTTCCAT ATAAAGATCT TGCAGTTCTT 462 0 

GTTTAATCTC AGCTTCGAGT TGCTGAGCCA AATTATGACG AGCAGAAGCA AGTTGACCTG 468 0 

CCAAATTGAC AAGATTGACT TCCAACTTCT TAAGCTCTGC TTCCATGTCC TCAGACGAAA 474 0 

GATTATTGCC TGTCAAGAGA TTGTATTCTT CCGTAATCTT GGCAAAATAA AGCAAAACAT 4800 

CATCAACAGT CCCACCATAC TTACGAGTAA TAGTATGAAG GAGGTCCAAA CGATTCTCAA 4860 

CCTGCATCAG GCGATTGCCA TCAAAATCAA GGTCCTCAAT GATAGCTTCC AAACGTTTGC 492 0 

TAATGTCTTC TAAAACATAG TAGGTCTCAG ACAGATAGCT TGAAATTTCA CGGTATTCAG 4980 

GATCATACTC TTCGACACTT TCCATGTCAT TCATAGCTGA ACGAACATTG GCCAGACTTG 504 0 

AAAAATCTTC ATTGTCCAAC ATACTGTAGG CATTGGTCAG TGTATCCGCA ATATTTTTGT 5100 

GGTTGAGGAG TTTATCTCGC TCTTGATTGA GAGCCAAGTC TTCTCCAGCC TGCAAGTTTG 5160 

CTGCCTCAAT CTCTGCCATT TGAAATTCCA ACATTTCGAT ACGTGCCTTG TGTTCCTGTT 522 0 

GGTTTTTCTT GACTTCCAGA ACCTGCTTGC GCATTTTCCG ATAGGCATCA AAACTCGTTT 52 8 0 

GATAGGTTTC TTTCAAGTCC CAAAAAGCGG CATCACCAAA TTCATCCAAC ATCTGGATAT 534 0 

GCAGTTGGGG ACGCATTAAC TCCTCATGGT CATGCTGACC ATGAATATCT ACAAGATGTT 5400 

GCCCAATAGC TCGCAAAACA GACAGATTAA CCATCTGACC ATTTACACGG CTGATACTAC 546 0 

GACCATTTTG CAAGATTTCC CGACGGATGA TAATTTCATC ACCTAATTCT AAACCTTGCT 552 0 

CATCAAAAAT TTCCTGTAAA AGACGACTAT TCTCAACTGA GAAAAGCCCC TCAATCTCTG 5580 

CCTTTGGTGC ACCATGACGA ATAACATCTG TCGTCGCACG AGCTCCCAAC ATCATATTCA 564 0 

TGGCATCAAT GATAATCGAC TTCCCTGCAC CCGTTTCACC AGTCAGGACA GTCATCCCCT 5700 

TTTCAAAATT GAGGGAAATA GCCTCAATAA TGGCAAAGTT TTTTATCGAA ATTTCAAGTA 57 60 
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ACATATAGAC CTACCAATTT TTTACTTGTT CAAAGATTTC CTCTGCTAGA CTTCCACTTC 582 0 

TGGCAATGAC TAAAATCGAG CTATCATCAG TCAAACAGCT AAAAATCTTG TCTGCAAAAG 588 0 

TCTCGATTAA CTGAGCTTTT ACAAAAGCCG TATTTCCTGG AATAACTTGG AGATTGATCA 594 0 

TCTTATCCAT CAATTCAGCC GATTCGATAT TGTCTTCAGC CAGTTGCAGA CTTTTTACGA 6000 

TTGATTTTGG CAATTCGTAG ACATAGGTGT TGTCTCTCAA AGGAATTTTG ACAATACCTA 6060 

ACTCTTTGAT ATCTCGGGAT ACCGTCGCCT GAGTGGCAGT GATACCTGCT TCTTTCAAAT 612 0 

GTTCTACAAT TTCTTCTTGC GTGCCGATTT GATAATCTGT CACCAATCTT CTAATTTTTT 6180 

CAAGTCTCTC TTTTTTATTC ATTTTTAAAT TGACTATGCG CCCTCTCTAC TGCTTCTTTA 6240 

ATCTCAGCAA GAATCTGATT GCTTGCTGAC TTTTCTTTTT TCAAATACGC TAAAAATTCA 6300 

ATATTTCCAT GTCCACCTTG GATGGGAGAA AAGTCCAAGC CAAGGACTGA AAAACCTACC 6360 

TCTACTGCCA TAGCTGTTAC AGATTCAAGG ACATTCTGAT GAACCTTAGC ATCTCGAATA 642 0 

ATTCCATTTT TCCCAATCTG CTCACGTCCT GCCTCAAACT GAGGTTTGAC AAGTGCTACC 6480 

ACCTGACCTT GATCAGCCAA GACACGGTGC AAGGCTGGCA AAATCAGACT AAGGGAAATG 6540 

AAACTCACAT CAATACTGGC AAAGCTCGGC TCCTGCTCGA AATCAGTCTT TTCAGCATAG 6 600 

CGGAAATTGA ACTGCTCCAT GCTGACAACT CGTGGGTCTT GGCGTAATTT CCAAGCCAAC 6 6 60 

TGATTGGTAC CAACATCGAC TGCAAAGACC AACTTGGCAC TATTCTGTAG CATGACATCG 6720 

GTAAAACCTC CAGTAGAGGC CCCGATATCA ATCGTAGTCG CGCCATCCAC CGACAAATCA 67 80 

AAGACCTGCA AGGCCTTTTC CAGTTTCAAA CCACCACGGC TGACATACTT GAGTTTCTCC 6840 

CCCTTGAGTT TTAATTCGGT GTCATCTGGA ATTTTCTCTC CTGGCTTGTC AAACCGTTCT 69 00 

CCATTAAGGA CTGCTACGAC TAGGCCAGCC ATCACACCTC GCTTGGCCTG CTCTCTCGTT 6960 

TCAAACAACC CCTGTTTATA AGCTAGTACA TCCACTCTTT CCTTAGCCAT TGATTCTCAA 7 020 

ACTTTCTACT ACACTTACAA TCGATTCTGT TTCAAAGGGA AGCTGCTGGG CAATTTCTTC 7 08 0 

TAATTTTTCA TTAGCTTGAT CCAGGGTTTG GTTACAAAAG GCAATGGACT CTTCCAAGCC 714 0 

CAACAGGGCA GGATAGGTTG ATTTTTCTGC CTGCAGATCC TTTTGAGGTG TCTTGCCGAT 72 00 

TTCCTCAAAA CTAGCTGTCA CATCCAGTAC ATCATCTCTG ACTTGAAAAG CAAGTCCAAT 72 60 

CAATTCACCC ACAGTTTTCA GCTTCACCTG CATTTCAGGT GACAATTCAG CTATAATAGC 732 0 

TGCCGCTTGG AAGGGATAGG CTAGTAACTT CCCAGTCTTA TTGGCATGAA TAGTCTGAAG 7380 

TTCTTCCAAA GACAAGTGCT GGTGTTCGCC CTCCATATCC AAAACTTGCC CTGCTACCAT 744 0 

ACCCAGACTA CCTGAAGCAA GGGATAAGTT GGCAATCAAG TCCACCTTAA TCTGACTTGG 7500 

CAAATCTGCC TGCGCAATCA AGGCATATGA GTCTAAGAAT AAGGCATCTC CAGCCAAAAT 7560 
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GGCCATAGCT TCACCGAATT TCTTGTGATT GGTTAACCGC CCTCTTCGAT AATCGTCATC 762 0 

ATCCATAGCA GGAAGGTCAT CGTGAATCAA GCTCCCTGTA TGAATCATCT CTAAGGCAGT 7680 

AGCTACCTGC GCGTGAGCAG GTTTGATGGT AACCTGCAAG GCTTCCAGAA CTTCTAACAA 7740 

GAGAAAAGGC CGAATACGCT TGCCACCAGC ATGAATAGAA TAGAGAACAG ACTCCCGTAA 7800 

ACTAGAGGCA AACTGCTGGT CTCCATAAAA ATCTTCCAAA GCCGACTCGA CAAGAGCTAA 7860 

TTTTTCTTGC TTTTTCATTC AAAATCACTT TCTGTTCCGT CTTCTTGCAT GACCTTGACC 7920 

AAGGTCTTTT CAGCCTTGTC CAGCGTAGCT TGGAGCTCTT TTGACAAGAC CATGCCCTTT 7980 

TGAAAGGCAG TAATCGCATC TTCCAGAGCA ATTTCACCAT TTTCCAAACT TTGGACAATG 804 0 

GTTTCCAGTT CTGCTAGATT TTCCTCAAAT TTCTTTTGTT TTGACATCTT TAACCTCTAA 8100 

TTCTACTTGA CCATCTCGCA TCAAAAGCGT TACTTGGTCT TTTTTCTTCA AACTCTCAAC 8160 

CGAATCTACA ACGGACTCTT CTTTTTTGAC AATAGCATAA CCACGCGCCA CGATTCGGCT 822 0 

AGTATCCAAC ATGAGCAAAG CTTCCGAAAG TCGCTTGGCC TCAGCAACCT TGGCGTCATA 8280 

AACTAACGCC ATTTGGCTAC CTAAGAGCTT GTCCAACTGT CCTAAACGGT CTTGATAGCG 8340 

TTGGATTTTG GTAACAGGTG ATAATTGTAC TAATTGATGA GTTCTTGCTT GAACTAATTG 8400 

TTTGTTATCA GAAATCCGAG TTCGCAAACT TTGTTTCAAA CGCAGTTGCA GTTGGTCCAA 8460 

GCGTTGCAAA TAACCGTCAT ACAAGCGCTC AGGTTGTCTA AAGATAACAG ACTGACTGCA 8520 

TTTTTTCAAA GCCTCTTGTT TCTTAGATAG AACATTTCGG ACTGCCGTTA CCATCCGTTT 8580 

TTCCTGATTT TGCAAATGAG CTAATACATC CAACTTGGTC ACAGGTGTTG CCAGTTCAGC 8640 

CGCCGCTGTT GGCGTTGCAG CGCGTCGATC TGCCACAAAA TCTGCCAAGG TCACATCCGT 8700 

CTCATGCCCC ACACTAGAGA TAACTGGCAA ACGAGATTCA AAAATAGCTC GTACCACAAT 8760 

TTCTTCGTTA AAGGCCCAGA GATCCTCAAT AGAACCACCT CCACGACCAA TAATGAGCAA 8 820 

ATCCAAATCG TCCCGTTGAT TAGCACGCGC AATATTTCTA GCAATTTCCT CCGCAGCCCC 8 880 

TTCACCTTGA ACCTTGGTCG GATAAAGAAG GATGTCAACA CCTGGGAATC GCCTGCTGAC 8 940 

GGTCGTGATA ATATCTCGAA TAACGGCTCC ACTACGGCTG GTTACTACAC CAATTCTCTT 9000 

AGAAAATTGG GGCAGAGCTT GCTTGAAGCG TTCTTGAAAC AGGCCTTCTT CTGTCAATTT 9060 

TTTCTTAAGT TGTTCAAACT GAATCGCAAG CGCCCCAACC CCATCAGGCT CAGCTTTTTC 9120 

AATGATGATG GAGTAGCTAC CACTTGGTTC ATAGACCTGT ACACGCCCAA TCACATTGAT 9180 

CTTCATTCCT TCTTCCAGGT CAAACCCTAA TTTCTGATAA ATCCCAGACC AGATGGTCGC 9240 

TTGAATAACT GCATGGTCAT CCTTTAGGGA GAAATATTGG TGAGTAGGTC GTTTACGAAA 9300 
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GTTGGAAACT TGACCAGTTA AATAGACCCG TTCCAAGTAT GGGTCTTTAT CGAATTTCAT 93 60 

TTTCAGATAC TTGGTCAAAG TTGTTACCGA TAAATACTTT TCCATCTCCA CCTACTATTC 942 0 

ATTTACTTGC TCTTTCATGG GTATTATTAT ACCAAAAATA TGCCTAAAAA TCTCCATTTA 9480 

TGTACCATTA TGAGGGAAAA ATAGAAAAAG GAGGCAAGGC CTCCACATGT GATTATTTGC 9540 

TGTTTCGAGC TTCTTCCAAA ATCTTTGCAA TCTTGGTCGT CAACAGGTCG ATAGCCACGG 9600 

TATTGCTAAC CCCTTCAGGA ATGACGATAT CAGCATAACG CTTAGTTGAC TCGATAAACT 9660 

GGTGGTACAT TGGTTTGACC ACACCTAAGT ACTGGTTAAT AACGCTATCA AGGCTACGGC 9720 

CACGCTCCTC CATATCACGC TTGATACGAC GAATAATGCG CACATCGTCA TCCGTATCCA 9780 

CAAAAATCTT GATATCCATC AAATCGCGCA GACGCTTGTC CTCCAAGACC AAAATACCCT 9840 

CAACGATAAA GACATCTTGA GGTTCCTGAC GATAGGTCTT GCTACTCCGT GTATGCTCTG 9900 

TATAGTCGTA GGTCGGGATG TCCACCGGAC GCCCTGCCAA CAATTCCTTA ATCTGCTCGA 9 9 60 

TCATCAAGTC TGTATCAAAG GCAAAAGGAT GGTCATAGTT GGTTTTGACG G 10011 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 77: 

CGTGTGGTCT TAAAAATAGA AGACAAAGAA CAAACTGTTG GAGGCTTTGT CCTTGCAGGC 60 

TCAGCCCAAG AAAAAACCAA AACAGCTCAA GTTGTGGCTA CTGGACAAGG TGTTCGTACC 120 

TTGAACGGTG ACTTGGTTGC TCCAAGTGTT AAAACTGGAG ATCGTGTCTT AGTTGAAGCC 180 

CACGCAGGTC TTGATGTCAA AGATGGCGAT GAAAAGTACA TCATCGTAGG CGAcTAACAT 2 40 

TTTGGCAATC ATTGAGGAAT AGAAGGAGAA AGTAAGTATG TCAAAAGAAA TTAAATTTTC 300 

ATCAGATGCC CGTTCAGCCA TGGTTCGTGG TGTCGATATC CTTGCAGACA CTGTTAAAGT 360 

AACCTTGGGA CCAAAAGGTC GCAATGTCGT TCTTGAAAAG TCATTCGGTT CACCCTTGAT 420 

TACCAATGAC GGTGTGACCA TTGCCAAAGA AATCGAATTG GAAGACCATT TTGAAAATAT 4 80 

GGGTGCTAAG TTAGTATCAG AAGTAGCTTC TAAAACCAAT GATATCGCAG GTGACGGAAC 540 

TACGACTGCA ACAGTCTTGA CCCAAGCTAT CGTCCGTGAA GGAATCAAAA ACGTCACAGC 600 

AGGTGCAAAT CCAATCGGTA TTCGTCGTGG GATTGAAACA GCAGTTGCCG CAGCAGTTGA 660 

AGCTTTGAAA AACAACGCCA TCCCTGTTGC CAATAAAGAA GCTATCGCTC AAGTTGCAGC 72 0 
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CGTATCTTCT CGTTCTGAAA AAGTTGGTGA GTACATCTCT GAAGCAATGG AAAAAGTTGG 7 80 

CAAAGACGGT GTCATCACCA TCGAAGAGTC ACGTGGTATG GAAACAGAGC TTGAAGTCGT 840 

AGAAGGAATG CAGTTTGACC GTGGTTACCT TTCACAGTAC ATGGTGACAG ATAGCGAAAA 9 00 

AATGGTGGCT G AC C T TG AAA ATCCGTACAT TTTGATTACA GACAAGAAAA TTTCCAATAT 9 60 

CCAAGAAATC TTGCCACTTT TGGAAAGCAT TCTCCAAAGC AATCGTCCAC TCTTGATTAT 102 0 

TGCGGATGAT GTGGATGGCG AGGCTCTTCC AACTCTTGTT TTGAACAAGA TTCGTGGAAC 108 0 

CTTCAACGTA GTAGCAGTCA AGGCACCTGG TTTTGGTGAC CGTCGCAAAG CCATGCTTGA 1140 

AGATATCGCC ATCTTAACAG GCGGAACAGT TATCACAGAA GACCTTGGTC TTGAGTTGAA 1200 

AGATGCGACA ATTGAAGCTC TTGGTCAAGC AGCGAGAGTG ACCGTGGACA AAGATAGCAC 1260 

GGTTATTGTA GAAGGTGCAG GAAATCCTGA AGCGATTTCT CACCGTGTTG CGGTTATCAA 132 0 

GTCTCAAATC GAAACTACAA CTTCTGAATT TGACCGTGAA AAATTGCAAG AACGCTTGGC 1380 

CAAATTGTCA GGTGGTGTAG CGGTTATTAA GGTTGGAGCC GCAACTGAAA CTGAGTTGAA 144 0 

AGAAATGAAA CTCCGCATTG AAGATGCCCT CAACGCTACT CGTGCAGCTG TTGAAGAAGG 1500 

TATTGTTGCA GGTGGTGGAA CAGCTCTTGC CAATGTGATT CCAGCTGTTG CTACCTTGGA 1560 

ATTGACAGGA GATGAAGCAA CAGGACGTAA TATTGTTCTC CGTGCTTTGG AAGAACCCGT 162 0 

TCGTCAAATT GCTCACAATG CAGGATTTGA AGGATCTATC GTTATCGATC GTTTGAAAAA 1680 

TGCTGAGCTT GGTATAGGAT TTAACGCAGC AACTGGCGAG TGGGTTAACA TGATTGATCA 1740 

AGGTATCATT GATCCAGTTA AAGTGAGTCG TTCAGCCCTA CAAAATGCAG CATCTGTAGC 1800 

CAGCTTGATT TTGACAACAG AAGCAGTCGT AGCCAATAAA CCAGAACCAG TAGCCCCAGC 1860 

TCCAGCAATG GATCCAAGCA TGATGGGCGG GATGATGTAA GCTTTCTATA GAAAACAACT 1920 

TATAAAAAAC ACAAAAGGAG GGAATGACTA ACCCTTCTTT TTATAGGCTC TTTGTCAACT 1980 

GTAGTGGGTT GAAGTCAGCT AAGCTCGAGA AAGGACAAAT TTCGTCCTTT CTTTTTTGAT 2 040 

GTTCAAAGCG ATAAAAATCC GTTTTTTGAA GTTTTCAAAG TTTCGAAAAC CAAAGGCATT 2100 

GCGCTTGATA AGTTTGATGA GATTATTGGT CGCTTCCGGT TTGGCGTTAG AATAGTGTAG 2160 

TTGAAGGGCG TTGATAATCT TTTCTTTATC TTTGAGGAAG GTTTTAAAGA CAGTCTGAAA 2220 

AATAGGATGA ACTTGCTTAA GATTGTCCTC AATAAGTCCG AAAAATTTCT CCGGTTCCTT 2 280 

ATTCTGAAAG TGAAACAGCA AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA 2 340 

ATAGCTCAAA AGCTTGTCTA AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGACG 2400 

ATAAAATCGC TTATCACTCA GTTTACGGCT ATCCTGTTGT ATGAGCTTCC AGTAGCGCTT 2460 
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GATAGCCTTG TATTCATGGG ATTTTCGATC CAATTGGTTC ATAATTTGAA CACGCACACG 252 0 

ACTCATAGCA CGGCTAAGAT GTTGTACAAT GTGAAAGCGA TCCAACACGA TTTTAGCATT 2 58 0 

CGGGAGTGAA ACAGTCTGGG AGACTGTTTC AGCCTGAGCC TAGAAATTTG AAAGCGAAGC 2 64 0 

TGTTTAGCCA AGTCATAGTA AGGACTAAAC ATATCCATCG TAATGATTTT CACTTGACAA 2700 

CGAACGGCTC TATCGTAGCG AAGAAAGTGA TTTCGGATGA CAGCTTGTGT TCTGCCTTCA 2760 

AGAACAGTGA TAATATTAAG ATTATCAAAA TCTTGCGCAA TGAAACTCAT CTTTCCCTTA 2820 

GTGAAGGCAT ACTCATCCCA AGACATAATC TTTGGAAGCC GAGAAAAATC ATGCTCAAAG 2 880 

TGAAAGTCAT TGAGCTTGCG AATGACAGTT GAAGTTGAAA TGGCCAGCTG ATGGGCAATA 2 940 

TCAGTCATAG AAATTTTTTC AATTAACTTT TGAGCAATCT TTTGGTTGAT GATACGAGGG 3000 

ATTTGGTGAT TTTTCTTTAC CAGGGGAGTC TCAGCAACCA TCATTTTTGA ACAGTGATAG 3 060 

CACTTGAAAC GACGCTTTCT AAGGAGAATT CTAGAAGGCA TACCAGTCGT TTCAAGATAA 312 0 

GGAATTTTAG AAGGTTTTTG AAAGTCATAT TTCTTCAATT GGTTTCCGCA CTCAGGGCAA 3180 

GATGGGGCGT CGTAGTCCAG TTTGGCGATG ATTTCCTTGT GTGTATCCTT ATTGATGATG 3 240 

TCTAAAATCT GGATATTAGG GTCTTTAATA TCGAGCAGTT TTGTGATAAA ATGTAATTGT 3 300 

TCCATATGAA TCTTTCTAAT GAGTTGTTTT GTCGCTTTTC ATTATAGGTC ATATGGGACT 3 3 60 

TTTTTTCTAC AACAAAATAG GCTCCATAAT ATCTATAAGG GATTTACCCA CTACAAATAT 3420 

TATAGAGCCG AAAATTCACA T CT AATAT AT GCAGACTACT TTGAAATGAA ATTAAAAAAA 3480 

TTATTAAAGG ATGACACAAA AGTTTTTGAA AAATCTACAT TCAAATTTGT AGAAGGATAT 3 540 

AAAATATACC TGACAGAATC TAAAGAATCT GGAATTAAAC AAATGGACAA TGTCATAAAA 3 600 

TATTTTGAGT TTATTGAATC TAAAAGTATT GCTTTATATT TTCAAAAACG ATTAAATGAG 3 6 60 

CTGATAGATT AAATAGCATT TTCTCTGTTG AGATATTGTT TTTAAAATAT TGTACTAAAT 37 2 0 

GATTGATGCT ATGTGGAAAT ACAAAAAAAT GTTTTTGATA CGAAGTTGAC CTGTATTTTT 3780 

TATACTAATC ATTTTCGTAT TTTTTGTATT AAACGATATA AGTTTGTTGT AAACTTACAA 3 840 

GGAATAAAGA CATTAAAAAA TAACAGTATA TCTATTTGTT TTATATATTT TACGAATTCT 3900 

GCATAAATCT CTTTCTAGTA ATGTGTTGTA ACTCTGCTAT AATAGATTTA TTCCTTTTTG 3 9 60 

TGTTTACACA ATTTATTTTA TAGTACCAAA AAAGGTCAGG ATTTTGTTCC TGACCTTTGA 4 020 

CAACTTTACC GATTCTTTAG TTCTACATAG CGCTTGTACC AAATGTTTAC ATAGGCTTCT 4080 

GAGAAAGGAC CACGTCCATT GTTAATCCAA TCAACAAGAA TTTTGACATG TTCTTTTAAA 4140 

ATATAGTCCA AGTCATCAGA ATAATTCATT TTGCGTTTGT GACGCTCGTA CTCTTCAACG 42 00 

TCCAAGAGAC GTTTTTCCCC ATCTGTAAAA ATTTTAACAT CCAAATCGTA ATCAATATAC 42 60 
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TTCAGTGCTT CTTCATCCAG ATAGTAGGGG CTAGCCATAT TGCAATAGTA AGAAGTTCCA 432 0 

TTATCACGAA TCATGGCAAT GATATTAAAC CAATATTTCT TGTGAAAGTA AACAATAGCC 43 80 

GGTTCTCGAG TGACCCAACG ACGACCATCA CTTTCGGTAA CAAGTGTATG ATCGTTGACA 4440 

CCAATAATGG CGTTTTCTGT TGTTTTTAGT ACCATGGTGT CCCGCCAAGT TCGGTGGAGA 4500 

CTCCCATCAT GCTTATAACT TTGAATTGTA ATAAAGTCGC CTTCTTTTGG AAGCTTCATA 4560 

ACTAACCAAC TTTCTACAAT TTATAAGTTT ATCATTTACT ATTGTACCAT AAAATTACCC 462 0 

AAAATCTGTG AATTTCACTT GGAAATATTA AAGATATTCT CTAAGAGCGC TTGCTATATC 4680 

CGAAAAATCG TAGCCCTTTC GTGCTAAAAC TTGAGTTAAA CGCTGCTTCA GTTCGTATCC 474 0 

TTCATACTTT CGGGCATACT TAGTATATTG CTTATCAAGT TCCTTGAAGA TGAGTTCCTG 4 800 

AGTCGTTTCT TCATCAACTT GACTATCCAA TTCGTCAAAG GCAATTTTAG CATCAAAATA 4860 

AGAGAAGCCC TTGTTAGTCA AGTTCTGGAT AATCTTATCT TGCAGGGCAC GAGCTGGAAG 492 0 

TTTTCCCTCA TATTTTTTCA ATAGTTTATT GGCTACACGT TGAGCAACTT CCGAAAAATC 4 980 

AAAATCATTC AAGATTTCTT CTATAGTAGA TTTTGAAATT CCTTTTTGTG CTAATTTCTG 504 0 

AGTCAGTACA TAAGGTCCCT TGTCTCCTGA AAGTTGATTG GCATTGATGA TAGCATAAGC 5100 

GTACTGGCTA TCATTAATCC ACTTCTCTTC TTTAAGATTA GCAATGACTT GAGAAACGAT 5160 

GTTTTCATTA ATATCATATT TTTTCAGATA TTCTCTGACC TCTTTTTCAG TACGTGCTTT 5220 

AAAGGATAAG TGGTAGAGGG CCAGATTCTT ACCATAAGAA AATTGAGCAA AGTCTTGAAT 5280 

CTCTTTCAAT TCCTCTTCGC TTATCACCTT ATCTCTCGAT AACATAAAAC GAACAATTGT 5340 

GTCTTCGGTG ATATAGCATT TGTCG 5365 
(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 78: 

TTTCCAGAAA GAAGTTGAGT AAAGTCTTTA TCAAAGAGAA TGACTTCCGT ATTGGAACTG 60 

ACATTAGGTT TTATTTCTAC TTTACTAGCG TCCGCCCTAG CATTTTCTAA ATCTTTAATC 120 

TCTTCTGTTG CCCTATTTAT AGCCAGCTGA ATAACTGCTT GAGGATTTTC ACTCAGTCCA 180 

TGAAGCTTAT CGTCCACCGA AGTATAAAGA CTCGAATGCA TGACTTGTAA AATAATCAGA 240 
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GTCATTGTAG AAAAAATCAG GGTGAAGACA CCGAAGTTGC GGATAAAATA ACTAAAGTCA 300 

TCCGCATACC ATGTTTTTTT AAGTTTACTG AACATCTTTT AAAAGATACC CAACACTACG 360 

CAAAGTTTGC AAATTCTCTG CAAAAGTGGT TCCCTTTAAT TTCTTACGGA CTTTTGAAAC 42 0 

ATAGACTTCG ACAACCGAAA TCGTTGTATC ACTATCAAAT CCCCATAGAC GGTCAAAAAT 48 0 

CTGCGTCTTA GGCAAAATCA CATTTTGATT TTGAAGGAAA TAAACTAGTA AATCGAACTC 54 0 

TTTCCCCAGC AATTCGACAG GAGTATCTTC AACTTTAACG GTATTGGTTG ATAAATTAAC 600 

CACGATATTC CCATAAGTCA AGGTGTTTTC ATTAAACTTC CCTGAACGTT TGAGAAGGGC 660 

CTGAATCCGC ATTTTAAGTT CTTCTAGGTA GAAAGGTTTG GTCAGATAAT CATCCGCTCC 720 

CAGTTCAAAT CCATGTCCCT TGTCATCCAA ACTTTCCTTG GCAGTCATAA TCAGAACTGG 780 

TGTCGTAATT CCCTTTTCAC GCAATTCTTT TAAGACTTGG AAACCATTTT TTTCTGGCAA 840 

CATCAAATCC AGCAAAATCA AGTCATAGAC ACCACTCTCA GCTTCGTAGA GACCTTCTTC 900 

TCCATCAAAT ACCTGCATAA CATCCGCAAA ATCGTCTAAA AAGTCAAATA CTGAATTTGA 960 

CAGACCTAGG TCATCCTCAA CCAATAAGAT TTTTATCATG AGAAACTCCT CCTTATTAAA 1020 

ACTATTATAC CAAATTTGCC TTAAAAAAAA CTCAACTCTC TGCATTTTAC ATGAGATAGC 1080 

TGAGTTTTCT TTTTATTTTA GGCTTATTTA TGCATTTCCG TATTGAAGAA CAACTGCTTC 1140 

GACTGCAGCT TTTTCACGGC TAATCAAGTC AACACGCGCT GCAATTTCCT TGATTCCCAT 1200 

ACCGATGTTA CGGCTAAGAG CAAGGTCAGA AAGTTGCGGT TCAAAGAACT CCTTGTATTC 1260 

CGCCAAGCGT TGCTGAGTCT TAAATACATG AGCAGGAAGG ATAACAAAGC TATCAAAGCT 1320 

CATATCTCCT CCAAGGGCTG CCTTAATCCA AGCCCAGTTT TCACGCGCCC AAGACCAAGC 1380 

TGTTTTCTGA GTTGCTTGAT GAGCTAGGAA TTGGTAATAC CAAGCAGACA AGTCCTGTGG 1440 

TTTGACCACA AATTTGTCCT TCCAAGAAGT AATCAGGTTT TGGATATTAT CCGCATCTGT 1500 

ACTGTATGCA AGAGCTGCTG CCAACTGGCG TTTAAAGACA GCATCTGTTG CGTGAGTATA 1560 

AGTATCAAGA TAAAGTGCTA ACAAGTCTTT AGTCTCATGA TGTTTCATCT CATTAATCAG 1620 

AACTTGTGAG CGAATAGCTG CTGGGAGTCC TGCAAGATTC TCCTTGTGTG TTGCGAAGAT 1680 

TTGGCTAGCG ACTTGACTAG CTTCTGCATC ATTTGAGCGA ATCATCATCG AAACAGCCAG 1740 

CTGACGAACC AATTCATCCT CATCTGATTC TCCGTCTTTA GCTTCAAAAC CAAGACGGTC 1800 

ATAGTTATGA CGAGCCAATT TAGCAACCAG TCCTTTGAAG GCTGTTTCAG CATCCGTTCC 1860 

TTCATCAATA AAGCGCTCAA GGGCTGAAAT CACTTGAGAA ACAGCTGAAA CCACCAGATA 1920 

AGACTCTTCC TTAGCAAGTT TATCAAGAAC TGGAAGCAAG TCTGCATAAG AAATGTGCCC 19 80 

TGCCTCAGCC AACAAACGAC GTTCTTGAAC AATTTGCAGT TTGCTTGTGT TATCAAGTGT 2040 
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CTCTAGCTCA GCAAGAACAG CTGCTAACAA GTCTCCTTGA TAGTCGGTAA TATAGTGGGC 2100 

AGTATTTTCA GTGTTGAGAC GAAGAGCTCC TTCATTTTCA GCAAGAAGAG CTGCGTAGCC 2160 

AGGGATTTCG ATACTTTCAG TTTCGAGTGT ATCAGGCAAG CCTTTCCAGT TGCTATTGAG 2 220 

GGGCACCACC CAGAGACGGT TCTTGTCTTC GTTCTCACCG ATGAAGAATT GTTTTTGTGA 2280 

AATCTTCAAG ACATCATTTT CAACTTTAAC AGTAAGAACT GGGTAACCAG GCTGTTCCAA 2 340 

CCAAGAATCC ATGAAGGCTG CGACATCACG TCCTGACGCT TGACCAAGGG CATCCCAAAG 2 400 

GTCACTACCA ATGGTGTTGC TGTATTGGTG TTTTTCAAAG TAGGCGTGCA AACCTTTAGC 2 460 

AAAATCAGCA TCTCCTAGCC AACGGCGAAG CATGTGCATG AGACGGCTTC CTTTGGCATA 2 520 

GACGATAGCG CCGTCAAAGA GTGTATTGAT TTCATCTGGA TGTTTAACTT CGACGTGGAC 2 580 

AGACTGAACG CCATCAGTAG CGTCACGTTC AAGAGCAAGA GGTACTCCAC CTGTTTGGAA 2 640 

ATCTTCAAAG ATATTCCAGC TTGGTTCGAT GGTATCCACA CAGACGTATT CCATCATATT 2 700 

AGCGAAACTT TCATTGAGCC AAAGGTCATC CCACCATTTC ATAGTCACGA GGTTCCCAAA 27 60 

CCATTGGTGA GCCAATTCAT GGGCCACAAC AAGGGCAACT TGTTGACGGC TAGCAAATGT 2 820 

AGAGTTCTCA TCGACAACCA AGTAAACTTC ACGGTAGGTC ACAAGACCCC AGTTTTCCAT 2 8 80 

AGCACCAGCT GAGAAGTCAG GAAGGGCGAT GTGGAGAGAT TGAGGAATTG GGTACTTAAC 2 940 

TCCATAGTAA TCTTCGTAAA ACTCGATAGA GCGAACAGCG ATATCCAGTG AGAAATCAAG 3 000 

ATTTGAAAGT GGATGTGCTT TGGTTGAGTA GACACCTACC AGGGTACCAT TTTTAGTTTT 3 060 

AGCGGTCACC CCTTGCAAAT CACCAGCAAC AAAGGCCAAC AAGTAAGAAG ACATGCGAGG 312 0 

TGTTGTCTCA AACTTCCAGA TACCTGTTTC CTTACGGTTT TCAACATCGA TTTCTGGCAT 3180 

GTTTGACAAG GCCAATTCAC CTTCTGCTTG GTCAAAGCGA AGAGAGAGGT CAAAAGTTGC 32 4 0 

TTTGGCTTCA GGCTCATCCA CACATGGGAA AGcTTCGCGC GCAAAATGGC TCTCGAACTG 3 3 00 

AGTAGACAAG ACCTCCTTCT TGACTCCATC AACTGTATAA TAAGAAGGGT AAATCCCTGT 33 60 

CATGTTGTCT GTAATTTTAC CAGAAAAGGC AAGAACCAAT TCAACTTGAC CAGCCTCAGC 342 0 

CAAT T CG AT A TGAAGGGCTT CATTGTCATG GTCAACTGTA AATGGACGAG CTTGACCTGC 3480 

AACTTCTACA GAGGTGATTT CCAAATCTTT TTGGTGGAGG GAGATGCGGT CACTCTGTGC 3 54 0 

TTGACCAGTG ATGGTCACTT TCCCAGAAAA AGTCTTGGTC TCACGACTCA AATCTAAAAA 3 600 

TAAATCATAA TGTTCAGGAA CAAATTGCTT AATGGG 3 63 6 

(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5066 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

ATAGCGTGTA ATAATCGATT TTAGAGGTAC CATAAGCCAC CTCCTACAAA TAGAAACCGA 60 

TATAAATCAA TGCCTTCCAC CCTTAGACTT CCCTAGTTCC TGTCTCAAGC GAAACATTTC 12 0 

TTTGAAACAG GAATAAGTTA ACCAATTCAT ACCAATAGCT AGCAGAATAA AAAGAAACCA 18 0 

AATGCCCCAT AACTTGATAT CTGTCACATT TCTCAAGACG GTATTGAAAA ACAGAACTGA 24 0 

AACAACTGTC CAAGCAAGGC TAAAAAGAGA ATAGAAGGGG ATGTAAAACC AGTAAAAATA 300 

ATAAAAAATT GGAAAAAACT TACTATTTCT GTTGGCCTTT TCAATCCAGT TATCAAAATA 3 60 

AAAGTACGGT GCTAAAAGTA AGAATTTAAA CAAATGTTCC ATCACCGACA TCCCCCCTTC 42 0 

TTTTGATAGC GTTTTCTATT ATTTTATTAT ATCAAAAAAA TCCGGAACTG TCATTCCAGA 4 80 

TTCTACTTTT TTATTTGCGT TTTCTTGCGA TGAGATGAAT CGGTGTTCCC TCAAAAACAA 540 

AGGCCTTGCG GATTTGATTT TCCAAGAAAC GCAGGTAAGA AAAGTGCATG AGTTCTTCTT 600 

CATTGACAAA GATGACAAAG GTTGGTGGTT TGGTTGCCAC TTGGGTCGCA TAGAAAATCT 660 

TGAGACGTTT TCCTTTGTCT GTCGGTGTTG GGTTGATGGC AATGGCATCC ATGATGACAT 72 0 

CGTTCAAGAC AGCTGATGGA ATACGTGTAT TTTGACTTTC GCTGATTTGC TTAATCATCT 7 80 

CAGGAAGTTT GTGGAGACGT TGCTTGGTTA AAGCTGATAC AAAGATAATC GGTGCGTAAG 840 

GCAGGTATTG GAACTGCTCA CGGATATCTT CTTCCCAGTT TTTCATAGTG TGGTTATCTT 900 

TTTCAAGCGT ATCCCACTTG TTGACCACGA TAATCATCCC TTTACCAGCT TCATGGGCAA 960 

ATCCTGCGAT ACGCTTGTCG TACTCACGAA TGCCTTCTTC CGCATTGATG ACCATCAAGA 102 0 

CCACATCTGA ACGGTCAATA GCACGCATGG CACGCATAAC AGAGTATTTC TCAGTATTTT 10 80 

CATAAACCTT ACCAGACTTA CGCATACCAG CCGTATCAAT CATGGTAAAC TCTTGACCAT 114 0 

CTGTATCTGT AAAGTGGGTA TCAATGGCAT CACGAGTTGT TCCAGCAACA GGACTAGCAA 12 0 0 

TAACACGGTC TTCTCCCAAG ATAGCATTGA TCAAGCTTGA TTTTCCAACG TTAGGACGAC 12 60 

CAATCAAGCT AAACTTAATG ACATCTGGAT TTTCTTCCTC ATATTCATTT GGAAGATTTT 132 0 

CTACGATCGC ATCTAGCACA TCCCCTGTAC CGATTCCATG GACAGATGAG ATAGGCAATG 13 80 

GTTCACCCAA ACCGAGAGCA TAGAAATCAT ATATATCATT TCTCATCTCA GGGTTGTCCA 1440 

CCTTGTTGAC TGCGAGGATA ACTGGTTTGT GGGTCTTATA AAGCTTACGA GCTACGTATT 1500 

CGTCTGCATC AGTAATTCCT TCCTTACCAG ACACGACAAA AACGATAACA TCTGCTTCTT 1560 
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CCATGGCAAT TTCTGCCTGG TGCTTGATTT GTTCCATGAA AGGAGCATCG ACATCATCAA 1620 

TTCCTCCTGT ATCAATCATG CTAAAAGAAC GATTGAGCCA CTCACCCGTT GCATAAATAC 1680 

GGTCACGTGT CACTCCTTCG ACATCTTCTA CAATGGAGAT TCGCTCACCA GCGATCCGAT 1740 

TAAATAGGGT TGATTTCCCA ACATTGGGAC GTCCTACAAT GGCAATAGTT GGTAGGGCCA 1800 

TAATTTCTCA CTTTCTACAA TAATTTCTTC TGTTCAAGAT TTTTTCTAGT TGAGCTTGGT 18 60 

TCAGCTTGAC CAAACTGTTC TGCTAGGCGC TGACTCCAGC TTGTGGTCGC ACGCGCCCCA 1920 

GCATAGTCAG CCTGAACACG GTCATAAGCT TGGATTGCCT CAGTTGACTG TTCTTGGTAT 1980 

TCTTCCTCAA AGACAACATT CTCTAGTGGC AGTCTCGGTT TCATATCATG ATGTTGATTT 2 040 

GGCACACCCA GTGCCATCCC AAAGACAGAA TAGGTGTAGT CAGGTAGGTT AAAGAGCTCT 2100 

GCCACTTCTT CAGACTTGTA TCGAACCAAA CCGATAATCA CACCACCATA GCCCAAGCTT 2160 

TCAGCTGCCA ACAAGGCGTT TTGTCCAGCA AGAGCTGCAT CGACCGAACT AATCAAGAGA 2220 

CCTTCCACAC CTTGGGGTTG GAAGGTGTCG GTATGAAGTC GGGCTCCCTT TTCTGCTCGG 2 2 80 

TTCAAATCTC CGACAAAGAG AAGGAAAACA GCAGACTGGC GAATGGCTTC TTGAGGTACC 2340 

AATTCATACA AGGCATCTTT CTTCTCTTGA CTTCGTACCA CAATCACAGA GTAGGATTGG 2 4 00 

AAATTCTTCC AAGATGATGC CATCTGGGCT GCTGTCAAAA TCTCATTTAA GTCTACTTGG 24 60 

GGAATTTCTT GCTCTTTAAA CCTGCGCACT GAAGTATGAG CCTTCATCAA TTTAATGGTT 2 52 0 

TCTGTCATCG ACGGTTTACT CCTTCTAAAC GAGTCTCCTC AGCCAAATAA CGGATGCGTT 2580 

CCATGACCCG TCTGGCTTCC CAGGTTTCGT CATTTCCATG TTTCACTTTC GCAAAATGCT 2 64 0 

TCTCCAAATC TTCAAAGTTG AAGTTGGATG TGAAAAAGGT CGGTAAATTT TCCTGCATCC 27 00 

GATATTGGAG AATGACCTGC AGGATTTCGT CACGCACCCA AACGGTTGAT TGCTCGGCGC 27 60 

CAATATCATC TAAAATCAGG ACCTCAGACA GCTTAATCTC ATCCACCAAG GTCTTAACAT 282 0 

TGCCATCACT GATAGCATTT TTGACATCAA TGACAAAGCT AGGATAGTGG AGGAGAGTTG 2880 

ATGAAACACC ACGTTTTTCT GATAAATCAT GAGCTAAGGC CGCCACCATG AAACTTTTAC 2 940 

CCACACCAAA GTCTCCATAT AAGTAAAGAC CTTTTCGAAT AGCTGGATAT TGCTCCACGA 3000 

AGGCTAGTAG CTTTTCAAAA ACTGGTAAGC GCCCCAAATC ATCCAAGTCA ACTTGAGCCA 30 60 

AACTAGCTTT CTTGAGACTG GCTGGTAGAT TGATTAACTT GAGACGGTTC TTAATAGCCG 312 0 

CTTCTTTTTC AGCCGCGATT AGCTCAGGAG TTTCTTCATA TGAAACATCT GCATAACCAT 3180 

GATTCTTAAC CAAAATCGGC TTGTAGCCTT TGGCAATATA ATCCGTATCC CCACGGAGAA 32 40 

ACTTGTCACG CTCGGTGATG TACTGATTAA ACTTGGAGAT ACTGCGATTT AATTCCTTTG 3300 
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GAGTTAAGGA TTCTTGCTGG ATAAAGGCCG CAACATCAGG GTCCTTCATG ATTTTCTGGA 3 3 60 

CCAAATCTTG ATAATAAAAA CGGCTGGGTT GACGTTTGAG TACGTCTCCG ACACTTTCCA 342 0 

TCTAATCTCC TCCTTTTTCT AATCGAGCTA ATAGTTCTTG CTTCTTACGT TCTAGTTCCA 3480 

GACGAGTTTC CTCGCTGGTT TCATTCTTAT ATTCAGGATT ACTCCATTTA GGAACATTGG 354 0 

TTTTTTCTGG GGCAGTCTGA TTCTGTTTTT GTGTTTTTGC TTTCTGCCCT CGATCACGAA 3600 

TTCGTAAAAC GGCCTCTTCT GCCGAATGAA TCTTTTGATA GGCATAGTCA TTGGCTACCT 3 660 

TCATGGCATA TTTCTCATTG ATATTTGCCG AATCCACCTT ATTAAAGGTC AATAAGAGAA 3720 

TAATATTGAT GACTTCGTCC AGTAAGCCCA AGCCAGCCAT CTGTTGCAAG AGTTCTCTTT 3 780 

CTGTTTGGGT AATGGTTCCC TTGCGTGTTT GCTTGATTTC TGCTAAGAAC TGCAGGGCAG 3840 

TTTTACTTTT AGCTTCTTTG ATAATGGTCG CTTCCTTAAG ACTAAAGTCA GAGGAAACTG 3 900 

GTTTTTGAGC AATTTTTTCA CGCATGCGTT TGGTTGAAAT AACCTGGGAA ACAGCTGTTG 3 960 

ACTTGGCCAA TTGATAGGTT TCAAACCAAG TCCATTTCTT CTCCTCGGCA ATAGCAAAGA 4020 

GGTTTAAGAC ATCGGACTGC TCATCCGCAA AACGAAGTCC ATCTCGAGCC ATCAGCTGGC 4 080 

GAAAATGTTC CAAGTCAAAA TCATTGGCCA CTTTCTTCTT GAGACCAAGG TCTTCTTGAC 4140 

TGCCTAGTTC TGCCAATTCT GGAAAGACTT GATTGAGTGA GACAGGTATT TCTTCACCAT 42 00 

CAGCACTTTC AACTTTCAAA TCCTCCACAG CTACATCGCC AATCTTTTTC TCTAAGAGTC 42 60 

TGCGATAAAC AGGATGCCCC AAGAAGTCTT GACTAGATAG AGGAGCATGG AGGGCTAGCT 4320 

GATAAACATC ACCCTTTTGA TAGAGGGTCA AGAGATTAAA AGCAGATAAG ATTTTCAATG 4380 

ATTTTATCAG TCTATCCATC CCAAAGTTGA GATGGTTGAG AATGCTTGAA AAAAGATATT 4440 

CCTTTCTACC ATTATCCCAA AAACTGATTG TATAAAGATA AAGGCTCAGT GCCTCCTGAC 4500 

CGATAATCGG GAGGTAGCAC TGTACCAGAG ATGAGGTATC TTGCGACACC CGATTATTCT 4 5 60 

TTAGATAAGA AAAACGGTCA ATTGGCTTCA TTTATCTTTC CTTTTTCTTT TTAGAGGACT 4620 

GGGTGATTTG TTGGAGCAAG CTCTCTAACT CACTGACATC CTTAAAACTA CGATAGACAC 4 680 

TAG C AAAACG TACATAGGTA ATCTCGTCCA ATTCAGCCAA CTCCTCCATG ACGAGTGAAC 474 0 

CAATGTCCTC ACTTTGAATT TCATTTTCAT TTCGACCACG GAGTTTCTGT TCGATACGAT 4 8 00 

TGACTACCAT GTTGATTTCA TCACTTGACA CAGGACGTTT CTGGGCTGAG CGGATAATCC 48 60 

CATTAAAGAT TTTATCTCTG GAGAATTGTT CCCGTGTGCC ATCTTTTTTA ACAACCACTA 492 0 

AGGTTCTTTC TTCTACTCGT TCGTAGGTTG TAAAACGGTG TTGGCATTCG TCGCACTCAC 4980 

GTCTTCTACG AATGGTGTTC CCTTCTTCTG CTTGGCGACT ATCGATAACA CTTGACTTGG 5040 

TAGCCCCACA TTTTGGACAG GGTACC 5066 
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(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

CACTTGAAGT ATTTGAAACA GCTATGGAAA ACATCATGCC TGTACTTGAA GTACGTGCAC 60 

GTCGTGTTGG TGGTTCTAAC TACCAAGTCC CAGTTGAAGT TCGTCCAGAA CGTCGTACAA 120 

CACTTGGACT TCGTTGGTTG GTAACAAT CG CTCGTCTTCG TGGTGAACAC ACAATGCAAG 180 

ACCGTCTTGC AAAAGAAATC TTGGATGCTG CTAACAACAC TGGTGCAGCA GTTAAGAAAC 2 40 

GTGAAGATAC TCACCGTATG GCTGAAGCTA ACCCTGCATT CGCACACTTC CGTTGGTAAG 3 00 

ATAGGATGCG AAAGCGTTAA GAAAGTCCCA GAGAAAATAG GGAATCGAAG CAGGTTGCGG 3 60 

TTGCAACCAA TGAGATTCAT CTTTTTCTCC AGACTTTTAG CTTGAGCTCA ACTAAATCAT 4 20 

GATGCTAGGA ACGGTAAGGA TGCAAGGTAA AAATAGGAAA CTGACGCAGT ATTCGACGAA 480 

TACAAGGAGT TTTATCTTTT TCACGCAGCA TCCCGTTCCA GCTCACATCG GCTAACTAAC 540 

TTTAGCCCGG GTTCAAATTA GCTAAATCGA TTAGTATTAG CTATAACTCA GCTTACCATC 600 

TCGTAAGTTG AAACCAACAA TAGCATGAAA ACATTGAGAA CGGGTAGGTC CTGCCTATCC 660 

GTTTTTATTA AAATCGTGTT ATAATAGAAT AGAAATCAAA AATAAATAGG AGAAACAAAC 72 0 

CTCATGGCAC GCGAATTTTC ACTTGAAAAA ACTCGTAATA TCGGTATCAT GGCTCACGTC 780 

GATGCCGGTA AAACAACAAC TACTGAGCGT ATTCTTTACT ACACTGGTAA AATCCACAAA 84 0 

ATCGGTGAAA CTCACGAAGG TGCGTCACAA ATGGACTGGA TGGAGCAAGA GCAAGAACGT 900 

GGTATCACGA TCACATCTGC TGCGACGACA GCTCAATGGA ACAACCACCG CGTAAACATC 9 60 

ATCGACACAC CAGGACACGT GGACTTCACA ATCGAAGTAC AACGTTCTCT TCGTGTATTG 102 0 

GATGGTGCGG TTACCGTTCT TGACTCACAA TCAGGTGTTG AGCCTCAAAC TGAAACAGTT 108 0 

TGGCGTCAAG CAACTGAGTA CGGAGTTCCA CGTATCGTAT TTGCCAACAA AATGGACAAA 1140 

ATCGGTGCTG ACTTCCTTTA CTCTGTAAGC ACACTTCACG ATCGTCTTCA AGCAAATGCA 12 00 

CACCCAATCC AATTGCCAAT CGGTTCTGAA GATGACTTCC GTGGTATCAT TGACTTGATC 1260 

AAGATGAAAG CTGAAATCTA TACTAACGAC CTTGGTACGG ATATCCTTGA AGAAGACATC 13 2 0 

CCAGCTGAAT ACCTTGACCA AGCTCAAGAA TACCGTGAAA AATTGATTGA AGCAGTTGCT 1380 
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GAAACTGACG AAGAATTGAT GATGAAATAC CTCGAAGGTG AAGAAATCAC TAACGAAGAA 1440 

TTGAAAGCTG GTATCCGTAA AGCGACTATC AACGTTGAAT TCTTCCCAGT ATTGTGTGGT 15 00 

TCAGCCTTCA AAAACAAAGG TGTTCAATTG ATGCTTGATG CGGTTATCGA CTACCTTCCA 1560 

AGCCCACTTG ACATCCCAGC AATCAAAGGT ATTAACCCAG ATACAGACGC TGAAGAAATT 1620 

CGTCCAGCAT CTGACGAAGA GCCATTTGCA GCTCTTGCCT TCAAGATCAT GACTGACCCA 1680 

TTCGTAGGTC GTTTGACATT CTTCCGTGTT TACTCAGGTG TTCTTCAATC AGGTTCATAC 17 40 

GTATTGAATA CTTCTAAAGG TAAACGTGAA CGTATCGGAC GTATCCTTCA AATGCACGCT 1800 

AACAGCCGTC AAGAAATCGA CACTGTTTAC TCAGGTGATA TCGCTGCTGC CGTTGGTTTG 1860 

AAAGATACTA CAACTGGTGA CTCATTGACA GATGAAAAAG CTAAAATCAT CCTTGAGTCA 192 0 

ATCAACGTTC CAGAACCAGT TATCCAATTG ATGGTTGAGC CAAAATCTAA AGCTGACCAA 1980 

GACAAGATGG GTATCGCCCT TCAAAAATTG GCTGAAGAAG ATCCAACATT CCGCGTTGAA 2 04 0 

ACAAACGTTG AAACTGGTGA AACAGTTATC TCAGGTATGG GTGAACTTCA CCTTGACGTC 2100 

CTTGTTGATC GTATGCGTCG TCAGTTCAAA GTTGAAGCGA ACGTAGGTGC TCCTCAAGTA 2160 

TCTTACCGTG AAACATTCCG CGCTTCTACT CAAGCACGTG GATTCTTCAA ACGTCAGTCT 22 2 0 

GGTGGTAAAG GTCAATTCGG TGATGTATGG ATTGAATTTA CTCCAAACGA AGAAGGTAAA 22 80 

GGATTCGAAT TCGAAAACGC AATCGTCGGT GGTGTGGTTC CTCGTGAATT TATCCCAGCG 2340 

GTTGAAAAAG GTTTGGTAGA ATCTATGGCT AACGGTGTTC TTGCAGGTTA CCCAATGGTT 24 00 

GACGTTAAAG CTAAGCTTTA TGATGGTTCA TATCACGATG TCGACTCATC TGAAACTGCC 24 60 

TTCAAGATTG CGGCTTCACT TTCCCTTAAA GAAGCTGCTA AATCAGCACA ACCAGCTATC 252 0 

CTTGAACCAA TGATGCTTGT AACAATCACT GTTCCAGAAG AAAACCTTGG TGATGTTATG 2580 

GGTCACGTAA CTGCTCGTCG TGGACGTGTA GATC-GTATGG AAG C ACACGG TAACAGCCAA 2 64 0 

ATCGTTCGTG CTTACGTTCC ACTTGCTGAA ATGTTCGGTT ACGCAACAGT TCTTCGTTCT 27 00 

GCATCTCAAG GACGTGGTAC ATTCATGATG GTATTTGACC ACTACGAAGA TGTACCTAAG 27 6 0 

TCAGTACAAG AAGAAATTAT TAAGAAAAAT AAAGGTGAAG ACTAATCCGT CCTCACTCTA 282 0 

GAAGGAAGTC ACTTAGTGGC TTCCTTTTGT CTTTAGAAAA TACCTCTAAA TATGGTAAAA 2880 

TAGTAGAAGA ATAATGTGAG GAAAATGAAT GTCAAATAGT TTTGAAATTT TGATGAATCA 2 940 

ATTGGGGATG CCTGCTGAAA TGAGACAGGC TCCTGCTTTA GCACAGGCCA ATATTGAGCG 3000 

AGTTGTGGTT CATAAAATTA GTAAGGTATG GGAGTTTCAT TTCGTATTTT CTAATATTTT 3060 

ACCGATTGAA ATCTTTTTAG AATTAAAGAA AGGTTTGAGC GAAGAATTTT CTAAGACAGG 312 0 

CAATAAAGCT GTTTTTGAAA TTAAGGCTCG GTCTCAAGAA TTTTCAAATC AGCTCTTGCA 3180 
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GTCCTACTAT AGGGAGGCTT TCTCTGAAGG TCCATGTGCT AGTCAAGGTT TTAAGTCCCT 3 2 40 

TTATCAAAAT TTGCAAGTTC GTGCTGAGGG TAATCAGCTA TTTATTGAAG GATCTGAAGC 3300 

GATTGATAAG GAACATTTTA AGAAGAATCA TCTTCCTAAT TTAGCCAAAC AACTTGAAAA 3 3 60 

GTTTGGTTTT CCAACTTTTA ACTGTCAAGT CGAGAAGAAT GATGTCCTGA CCCAAGAGCA 342 0 

GGAAGAGGCC TTTCATGCTG AAAATGAGCA GATTGTTCAA GCTGCCAATG AGGAAGCGCT 3480 

CCGTGCTATG GAACAACTGG AGCAGATGGC ACCTCCTCCA GCGGAAGAGA AACCAGCCTT 3540 

TGATTTTCAA GCGAAAAAAG CTGCAGCTAA ACCCAAGCTG GATAAGGCGG AGATTACTCC 3 600 

TATGATCGAA GTGACGACAG AGGAAAATCG TCTGGTATTT GAAGGGGTTG TTTTTGATGT 3 660 

GGAGCAAAAA GTGACTAGAA CAGGTCGTGT TTTAATCAAC TTTAAAATGA CGGACTATAC 3720 

TTCAAGTTTT TCTATGCAAA AGTGGGTTAA AAACGAGGAA GAGGCCCAGA AGTTTGACCT 3780 

CAT C AAGAAG AATTCTTGGC TCCGAGTTCG AGGGAATGTG GAGATGAATA ACTTCACACG 384 0 

CGATTTGACT ATGAACGTAC AGGATCTGCA GGAAGTTGTT CACTATGAGC GGAAGGATTT 3 90 0 

GATGCCAGAA GGTGAGCGTC GGGTTGAGTT TCATGCTCAT ACTAACATGT CGACTATGGA 39 60 

TGCTTTGCCA GAGGTCGAAG AGATTGTTGC AACAGCTGCT AAGTGGGGAC ACAAGGCGGT 4020 

TGCTATCACG GACCATGGGA ATGTCCAGTC CTTTCCACAT GGCTATAAGG CGGCTAAGAA 4080 

AGCGGGAATC CAGCTGATCT ATGGGATGGA AGCCAATATC GTGGAGGACC GTGTCCCTAT 4140 

CGTCTATAAC GAAGTGGAGA TGGACTTGTC AGAAGCAACC TACGTGGTCT TTGACGTGGA 4200 

AACGACGGGA CTTTCAGCTA TCTATAATGA CTTGATTCAG GTTGCGGCTT CTAAGATGTA 42 60 

CAAGGGGAAT GTTATTGCTG AATTTGATGA ATTTATCAAT CCTGGGCATC CCTTGTCAGC 4320 

CTTTACTACA GAGTTAACTG GAATTACAGA TGATCATGTC AAAAATGCCA AACCACTAGA 4380 

ACAAGTTTTG CAAGAATTCC AAGAATTTTG CAAGGATACG GTCCTAGTTG CCCACAATGC 444 0 

TACCTTTGAC GTTGGCTTTA TGAATGCTAA TTATGAGCGG CATGATCTTC CAAAGATTAG 4500 

TCAGCCAGTT ATTGATACGC TGGAGTTTGC TAGAAACCTC TATCCTGAGT ATAAACGCCA 45 60 

TGGTTTGGGG CCTTTGACCA AGCGTTTTGG TGTGGCCTTG GAACATCACC ACATGGCCAA 4 620 

CTACGATGCG GAAGCGACTG GTCGTCTGCT TTTCATCTTT ATCAAAGAGG TAGCAGAAAA 4680 

ACATGGTGTG ACCGATTTAG CTAGACTCAA CATTGATCTA ATCAGTCCAG ATTCTTACAA 474 0 

AAAAGCTCGG ATCAAGCATG CGACCATCTA TGTCAAGAAT CAGGTAGGTC TAAAAAATAT 4800 

CTTTAAGCTG GTTTCCTTGT CTAATACCAA GTATTTTGAA GGAGTGCCAC GGATTCCGAG 4860 

AACGGTTCTA GATGCCCATC GAGAGGGCTT GATTTTAGGT TCAGCCTGTT CAGAGGGTGA 492 0 
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AGTTTTTGAC GTGGTCGTTT CTCAAGGTGT GGATGCGGCG GTTGAGGTGG CCAAGTATTA 49 80 

TGATTTTATC GAGGTCATGC CACCGGCTAT CTATGCACCC TTGATTGCCA AAGAGCAGGT 5040 

CAAGGATATG GAGGAACTCC AGACCATTAT CAAGAGTTTG ATAGAGGTTG GAGACCGCCT 5100 

TGGCAAGCCT GTTCTGGCTA CGGGAAATGT TCACTATATC GAACCGGAAG AAGAGATTTA 5160 

TCGTGAAATT ATCGTCCGTA GTTTGGGACA GGGTGCGATG ATTAATCGAA CTATCGGTCA 5220 

TGGTGAACAT GCCCAACCAG CACCACTTCC AAAGGCTCAT TTTCGAACGA CTAATGAGAT 52 80 

GTTGGATGAA TTTGCCTTTT TGGGAGAGGA ACTGGCTCGT AAACTGGTTA TTGAAAACAC 5340 

CAATGCCTTG GCAGAAATAT TTGAATCCGT TGAAGTCGTT AAGGGTGACT TGTATACGCC 54 00 

TTTCATCGAC AAGGCTGAAG AAACAGTTGC TGAGTTGACC TATAAGAAAG CTTTTGAGAT 54 60 

TTATGGAAAT CCGCTGCCAG ATATTGTTGA TTTGCGGATT GAAAAAGAAT TAACATCCAT 552 0 

ACTGGGGAAT GGATTTGCTG TGATTTATCT GGCATCGCAG ATGCTGGTGC AACGTTCTAA 5580 

TGAACGGGGT TATTTGGTTG GTTCTCGTGG GTCTGTCGGA TCTAGTTTCG TTGCGACCAT 564 0 

GATTGGGATT ACGGAGGTCA ATCCTCTCTC TCCTCACTAT GTCTGTGGTC AGTGTCAGTA 57 00 

CAGTGAGTTT ATCACAGATG GTTCGTACGG TTCAGGATTT GATATGCCCC ATAAGGACTG 57 60 

TCCAAACTGT GGTCACAAAC TCAGTAAAAA CGGACAGGAT ATTCCGTTTG AGACCTTCCT 582 0 

TGGTTTTGAT GGGGATAAGG TTCCTGATAT TGACTTGAAC TTCTCGGGAG AAGATCAGCC 5880 

TAGCGCCCAC TTGGATGTGC GTGATATCTT TGGTGAAGAA TATGCCTTCC GTGCGGGAAC 5940 

GGTTGGTACG GTAGCTGCCA AGACTGCCTA TGGATTTGTC AAAGGTTACG AGCGAGATTA 6000 

TGGCAAGTTT TATCGTGATG CAGAAGTAGA ACGCCTCGCT CAAGGAGCGG CGGGTGTCAA 6060 

GCGGACAACA GGCCAACACC CGGGGGGAAT CGTTGTTATT CCGAACTACA TGGATGTCTA 612 0 

CGATTTTACG CCTGTCCAGT ATCCAGCAGA TGATGTCACG GCTGAATGGC AGACCACTCA 6180 

CTTTAACTTC CACGATATCG ATGAGAACGT CCTCAAACTC GATGTACTGG GACATGATGA 62 40 

TCCGACTATG ATTCGAAAAC TTCAGGATTT GTCTGGTATT GACCCTAATA AAATTCCTAT 63 00 

GGATGACGAA GGCGTGATGG CACTCTTTTC TGGGACTGAT GTGCTAGGGG TAACACCTGA 63 60 

ACAAATTGGA ACGCCTACGG GTATGTTGGG GATTCCAGAG TTTGGAACAA ATTTCGTACG 642 0 

TGGAATGGTA GACGAAACCC ATCCGACAAC CTTTGCGGAA TTGCTTCAGC TGTCTGGTCT 64 80 

GTCCCACGGT ACTGATGTTT GGTTGGGGAA TGCTCAGGAT CTGATTAAGC AAGGAATAGC 6540 

GGACCTATCG ACTGTTATCG GTTGTCGGGA CGACATCATG GTTTACCTCA TGCATGCGGG 66 00 

TCTGGAACCT AAGATGGCCT TTACCATTAT GGAACGGGTA CGTAAGGGTT TGTGGCTAAA 6660 

GATTTCAGAA GAGGAGAGAA ATGGCTATAT CGAAGCAATG AAGGCTAATA AGGTGCCAGA 672 0 
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GTGGTATATC GAATCCTGTG GGAAAATTAA GTACATGTTC CCTAAGGCCC ATGCGGCAGC 6780 

CTACGTTATG ATGGCCTTGC GTGTAGCTTA CTTCAAGGTT CACCATCCTA TTTATTACTA 6840 

CTGTGCTTAC TTCTCCATTC GTGCTAAGGC TTTTGATATC AAGACCATGG GTGCGGGCTT 6900 

GGAGGTCATC AAGCGCAGAA TGGAAGAAAT CTCTGAAAAA CGGAAGAACA ATGAAGCCTC 6 960 

TAATGTGGAA ATCGATCTCT ATACAACTCT TGAGATTGTC AATGAGATGT GGGAACGAGG 7 020 

TTTCAAGTTT GGTAAATTAG ATCTCTACTG TAGTCAGGCG ACAGAGTTCC TCATCGACGG 7 080 

GGATACCCTT ATCCCACCAT TTGTAGCAAT GGATGGTCTG GGAGAGAACG TTGCCAAGCA 7140 

ACTGGTGCGG GCGCGTGAAG AGGGAGAATT CCTCTCTAAA ACAGAACTAC GCAAGCGTGG 72 00 

TGGACTCTCA TCAACCTTGG TTGAAAAGAT GGATGAGATG GGTATTCTTG GAAATATGCC 72 60 

AGAGGATAAC CAGTTGAGTT TGTTTGATGA GTTGTTTTAA AAAATTGCTT AATAATCTAT 7 320 

TAAAAGAGGC TAACGTATAT CCAATAGATT TACATTAGCT TTCTTTTTTG TTAAAATAGT 73 80 

CTATGGAAAG AGGGTGAGAG TATGTCAAAG ATGAGTATAA GCATCCGTCT GGATAGTGAG 744 0 

GTTAAGGAGC AGGCCCAACA GGTGTTTAGT AATCTGGGAA TGGATATGAC AACAGCTATT 7 500 

AATATTTTCC TTCGTCAGGC AATTCAATAT CAGGGATTAC CTTTTGATGT TAGACTAGAC 7 5 60 

GAAAATCGGA AGTTGCTCCA AGCGTTAACG GATTTAGACC AAAATCGTAA TATGAGCCAG 7 62 0 

TCTTTTGAAT CAGTCTCAGA TTTGATGGAG GACTTACGTG CTTAAGATTC GTTATCATAA 7 6 80 

ACAGTTTAAA AAAGATTTTA AGTTGGCTAT GAAGCGTGGT TTGAAGGCAG AATTATTAGA 774 0 

AGAAGTTTTG AATTTTCTGG TTCAAGAAAA AGAACATCCT GCCAGAAATC GTGATCATTC 7800 

ATTGACGGCA TCCAAGCATT TTCAAGGAGT TCGTGAATGC CATACCCAGC CAGATTGGCT 78 60 

TTTGGTTTAT AAAGTAGACA AGTCGGAATT GATTTTAAAT TTGCTGAGGA CAGGCAGTCA 792 0 

CAGTGATTTA TTTTAATCTA TTTTAAGGGG GTTCTCATGA AACTAAGAAT ATTTGCGGAA 79 80 

GATAAGCCGG CTAAGAAGGT ATTTGAATAT CAATTAGAAC TTGCTGATCG TACAATTCTT 8040 

CTATCGACAG CACTCTTGTC AGGTGCTATT GCTTTAGCAG GAATCTTTTC TGCTTTGAAA 810 0 

GAAAAATAAA AATAGAAAAG AGAAAACAGA ATGGTTTTAC CAAATTTTAA AGAAAATCTA 8160 

GAAAAATATG CGAAATTGTT GGTTGCGAAC GGAATTAACG TGCAACCTGG TCACACTTTG 822 0 

GCTCTCTCTA TTGATGTGGA GCAACGTGAA TTGGCACATC TAATCGTGAA AGAAGCTTAT 82 8 0 

GCCTTGGGTG CGCATGAGGT CATCGTTCAG TGGACAGATG ATGTGATTAA CCGTGAGAAA 8340 

TTCCTCCATG CCCCGATGGA GCGTTTGGAC AATGTGCCAG AATACAAGAT TGCTGAGATG 8400 

AACTATCTCT TGGAGAATAA GGCTAGCCGT CTTGGAGTTC GTTCATCTGA TCCAGGTGCC 8460 
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TTGAACGGAG TGGACGCTGA CAAGCTTTCA GCTTCTGCTA AAGCTATGGG ACTTRCCATG 8520 

AAGCCTATGC GTATCGCAAC TCAATCTAAC AAGGTTAGCT GGACTGTAGC AGCTGCAGCA 8580 

GGACTTGAGT GGGCTAAGAA AGTCTTCCCA AATGCTGCGA GCGACGAAGA AGCAGTTGAT 8 640 

TTCCTTTGGG ACCAAATTTT CAAAACTTGC CGTGTCTACG AAGCAGATCC TGTTAAGGCT 87 00 

TGGGAGGAAC ATGCAGCCAT TCTCAAGAGC AAGGCCGATA TGCTTAATAA GGAGCAATTT 87 60 

TCAGCCCTTC ACTACACAGC GCCAGGAACA GATTTAACAC TTGGTTTGCC AAAGAACCAC 882 0 

GTTTGGGAAT CAGCTGGTGC TGTCAATGCA CAGGGCGAAG AATTCTTGCC AAATATGCCA 8880 

ACAGAAGAGG TCTTCACAGC GCCTGACTTC CGTCGTGCAG ATGGTTATGT CACTTCTACA 8 94 0 

AAACCGCTTA GCTACAACGG AAATATCATT GAAGGCATTA AGGTGACCTT TAAGGATGGA 9 000 

CAAATCGTAG ATATCACTGC TGAGAAGGGT GATCAGGTTA TGAAAGACCT TGTCTTTGAA 9060 

AATGCGGGTG CGCGTGCCTT GGGTGAATGT GCCTTGGTAC CAGATCCAAG TCCAATTTCT 912 0 

CAGTCAGGCA TTACCTTCTT TAACACCCTT TTCGATGAAA ATGCGTCAAA CCACTTGGCT 9180 

ATCGGTGCAG CCTATGCGAC TAGCGTTGTT GATGGAGCGG AGATGAGCGA AGAGGAGCTT 92 4 0 

GAAGCTGCAG GGCTTAACCG TTCAGATGTT CACGTAGACT TTATGATTGG TTCTAACCAA 93 00 

ATGGATATCG ATGGTATTCG TGAGGATGGA ACGCGGGTAC CTCTTTTCCG TAATGGGAAT 93 60 

TGGGCAAATT AAGGAGATAA TATGTTAGGA AGTATGTTCG TTGGTCTCCT AGTGGGATTT 94 2 0 

TTAGCAGGTG CTATGACCAA TCGTGGAGAG CGAATGGGAT GTTTTGGAAA AATGTTTCTC 94 80 

CGTTGGATCG GAGCCTTTCT AGGTCACTTG CTCTTTGGAA CTTGGGGGCC AGTTTTATCA 9540 

GGAACAGCTA TTATCCCAGC GATTTTAGGA GCCATGATTG TTTTAGCTAT TTTTTGGAGA 9600 

CGAGGAA 9607 
(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 81: 
CTACAAGATA ATTCCAGCTA TAACATCCGC TATAATAGTA AGAGCGAGCT CTATGATAAG 60 

GCTCATTAGT TTCACCTCCT CTCACGAACC CATAGGAACG TAATCGGTAA CCGATGACAA 12 0 

AAATAGTATA CCACAATACA TTTAGATCAT CAAGGTCACT TAATTCTTGA AAT AT C AG AT 180 

CTAAGAGAAA AATCTTTAAA ATCAGAAAAA CGCATAATAT CAGGTGTGCA AAAACTTGAT 24 0 
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ACTATGCGTT TTATTGTGGG AAGGTTTACT CCATTTTCTC CTGAAATTGA GTTTTTGTCC 300 

AGCCTCTGTT TTTAGGGTTG CTAAGAAAAT AATGTCATGT GGTGAATATT TGTAAATCAG 360 

TCAGCAGACA GAACGATACT CTTCGAAAAT CTCTTCACAT CATGTCAGCT TCGTCTTTCC 42 0 

GTATATATGT GACTGACTTC ATCAGTTCTA TCTACAACCT CAAAACAGTG TTTCGAGCTG 480 

ACTTGATCAA TTTTCAAATC TGTACTTTGA GCAAGCTGAG ACTAGCTTCC TATTTGATTT 540 

TCATTGAATA TCAGAAACCC ATTCTCCATC AAATAATTCG ACTGCGTCTA ATAATTTTTG 600 

ATCTGGCACG GTGTCTGAAA TAAAGGTTGT GTATTTGGAG AGGGGATTAA TTTTAAAAAA 660 

TCCAGTCTTG TAAAATTTAG AACTATCAAT CAGTAAGATG GTTTCATGGG CTTTGTCAAT 720 

AATATTCTTT TTTGAAATAG CTTGGCTGAG AGAAGCTTCA TAAACATATT GGTCATCAAT 780 

ACCTCTTGCT GAACAAAATG C TAAATCG AT ATTAAAATGA TCTAATAAAG AATTTTCCTT 840 

ATCATAGTTG ACCACGGAAC AGGATTGATG TTTGACCTCG CCAGATGTGA TAAAGATTTT 900 

GGAGCTATCT TTAACAGTTT CAGATAGGGT TTGTGCAGTA TGTAAACCAT TTGTAAAAAT 960 

AATCAAATTA TCAAGTTCAG AAAGATAGGG ACAGAGTTCG TAGACAGTAG TACTAGAATC 1020 

TAGATAGATA CACATACCAG ACCGAATAAA GTCTTTAGCG AGACTAGCGA TTAGTCTTTT 1080 

TTGCCTAGTA CTTTCTCCTT CACGTATTTG ATGAGAAAGT TCAATTGTGT TCATAGAGGA 1140 

CAGGGTCACG TATCCGTGCT TTCTTTTGAT AAGACCTTGA TTTTCTAAGA AAATTAAATC 12 00 

ACGACGTAAG GTACTTGTGC TGGAGAAAGT GATTTCTGCC AGCTCTTTTA CGGCAATTCT 12 60 

TTTTTTCTTT TTGATAATTT CAATCAATTC AAGTACACGT TCATCTTTTA TCATAAGCTC 1320 

CTCCTAATTT ATCATTTCAA CTATATTATA GCACAAATTG GAGGAATTTG AATTATTTTT 13 80 

ATGAATATTG GGTTAACATT TGAACATTAT TCAAGTAAGC GTTCACATAT TGAAAAAATA 1440 

AAACGTGGGG ATTATAATAA AGTTAATCmA GGACGAAGAG AGAAGAAAAA TGGAAGCGGT 1500 

TTTAGCAATA GATTTAGGTG CGACTTCTGG AAGAGCAATC GTTGGTTACC TTTCTGAAAA 15 60 

TAAACTAGTA ATGGAAGAAA TAAATCGCTT TTCTAATCTA CCTATTAGAG TAAAAGGGCA 162 0 

TTTATCTTGG GATATTGACT TTCTACTAGC TAAAATTCTT GAAAGTATCC GCTTGGCTAA 1680 

TACTAGTTAC AAGATTTTAT CTATCGGTAT TGACACATGG GGAGTTGATT TTGGACTGAT 174 0 

TGATAATGAA GGTAAGCTGT TATTACAACC TGTTCATTAT CGTGATGAAA GAACAAAGGG 180 0 

AGTGTTAAAG GAAATATCTG AAATGACTGA ATTAGAAAAA CTGTATTCAG AGACAGGAAA 18 60 

TCAGATTATG GAGATAAATA CCTTGTTTCA ACTCTTTAAG GCACGTCAAG AATCTCCTGA 192 0 

CTCTTTCTAT AAGACCAATA AGATTCTTTT AATGCCAGAT TTGTTTAATT ATCTCTTGAC 1980 
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AGGTAAGTTT GCTACAGAAA AAAGCATTGC TTCAACAACT CAATTATTTG ATCCTAGGAG 2 040 

TCAAAATTGG AATCAGAATA TCTTAAAACT ATTTGAATTG GATTCATCTT TACTTCCTGA 2100 

AATTGTTTCA GAGGGAAATG TTCTTGGAAG GATAAAAGAG GAGTATGGTT TAGGCGATAT 2160 

TCCTGTTGTG AATGTTTGTA GTCATGATAC AGCAAGCGCG ATTGTCTCAG TACCTAAGAC 22 20 

AGAAGGTAGT TTATTTATTT CATCAGGTAC TTGGTCTTTG GTTGGAGTGG AACTTACTTC 2280 

ACCGATTCTT ACTACCGAAT CCTTCAGTTA TGGATTTACA AATGAAGTCG GTAAAGATGG 2 3 40 

AGTGATTACA TTTCTGAAGA ATTGTACAGG GTTGTGGATC ATAGAGGAAC TAAGACGTTC 2400 

ATTTGAACGA AGAGGGAAAG CCTATTCTTT TGATGATATT AGGACAATGG TGGAGAAAGA 24 50 

AAAAGAAAAT CTTCCTCTGA TTGATACTGA ATCAACTGAA TTTGCAACAG AATCTGATAT 2 520 

GCACAAGACT TTGACAGAAT ATCTAGCTTA TCATCATGAA ACTAGAGAGT GGACAGATGG 2 530 

ACAACTATTT AAGATTGTTT ATGAAAGCCT AGCTGAAACG TATAGGAAAG CGATAGAGTT 2 640 

ACTAGAAGAA CTAACTCATA AGGTTTATAA GAGGATATAT GTGATTGGAG GAGGTGCTAG 2 7 00 

AGCCAGTTAC TTTAACCAAA TGATTGCTGA TAGAACTGGT AAAGAGGTTC TTACAGGTTT 27 50 

GACTGAGGCT ACAGCTGTGG GGAATATTGT TGTGCAGCTC ATAGCTATGG GACAATTAAA 2 82 0 

AGGGATGGAA GAGGCTCACC ATGTTATTGA GGAGTTTCTA CAATTAGAGA GTTATTACTC 2 8 80 

CCAAAAGAAT TAAAAAGATT GAGAGTTTGT AAATTTGCCT CCCTCCCCCT TCTTAGCTTT 2 94 0 

TGTGCAGGAA GGGGGGATAA TTGGTGAATT GAAAAATATT TAGTGTTTTG ATATGAGGAG 3 000 

GACAAGGATG TCAGATGTAA AACAAGAATT AATTAAATAT GGTAAGAAGC TAGTAGAAAC 3060 

AGATTTGACG AAAGGAACAG GTGGGAATCT CAGCGTTTTC GATCGTGAAA AACAATTGAT 3120 

GGCAATTACC CCGTCGGGTA TTGATTTCTT TGAAATCAAA GAATCCGATA TTGTAGTGAT 3180 

GGATATTAAT GGAAATGTTG TAGAGGGAGA ACGCTTGCCA TCTAGCGAAT GGTATATGCA 32 4 0 

TTTGATTCAA TATCAAACTC GTGATGATAT CGATGCAATT ATCCATGCTC ATACAACTTA 33 00 

TGCAACAGTA TTAGCTTGTC TCAGAGAACC ACTTCCAGCG AGTCATTATA TGATTGCAGT 33 60 

GGCAGGGAAA GATGTTCGGG TAGCTGAGTA TGCAACATAT GGCACGAAAG AATTGGCTGT 342 0 

GAATGCAGCT AAAGCAATGG AAGGTCGTAG AGCAGTTTTA CTAGCGAATC ATGGAATTTT 34 80 

AGCAGGTGCA CAAAATTTAT TGAATGCATT TAATATTGTT GAAGAAGTTG AATATTGTGC 3 540 

AAAAATTTAT TGTTTAGCTA AGAATTTTGG AGAGCCAGTA GTTCTTCCTG ATGAGGAGAT 3 600 

GGAATTGATG GCAGAAAAAT TTAAAACATA CGGTCAGAGA AAATAGGGAG GATATTAATG 3660 

TTAAAACATA TACCGAAAAA TATTTCTCCA GATTTATTGA AGACTTTAAT GGAAATGGGA 3720 

CATGGAGATG AAATAGTATT AGCTGACGCG AATTATCCTT CTGCCTCATG TGCAAATAAG 37 80 
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CTAATTCGTT GTGATGGTGT AAATATTCCA GAATTATTAG ATTCCATTCT GTATTTAATG 3840 

CCATTAGATA GTTACGTCGA TAGTTCAATT CAGTTTATGA ACGTTGTTTC GGGTGATGAT 3 900 

ATTCCTAAGA TATGGGGTAC CTATAGACAG ATGATTGAAG GTCATGGTAC AGATCTTAAA 3960 

ACGATTACTT ATCTTAGAAG AGAAGACTTT TATGAACGTA GTAAGAAAGC TTATGCTATT 402 0 

GTTGCTACAG GAGAAACTTC ACTTTATGCT AATATTATCC TTAAGAAAGG AGTAGTTGTT 4080 

GAAAGAGAAA ATGTTCAATA GAGGAATTTT AGTTGCCAGT CATGGTAATT TTGCTAGCGG 414 0 

AGCTCTCATG ACCGCAGAAA TGTTTGTTGG TGAGACAACA AATGATAGAG TTAGGACATT 4200 

AGGTTTGATG CCTGGAGAGA ATATTGTAGA GTTTGAGCAT TATTTTAAAA ATCAAGTGGA 42 60 

TGAACTGTTA GACTCAAATC AAGAGGTTAT CGTTTTGACT GACTTGATTG GAGGAAGTCC 432 0 

TAATAATGTG GCTTTGTCAC GGTTTTTAAA TTTGGATTCA GTTGATATTG TAACAGGGTT 43 80 

TAATATCCCT CTCCTAGTGG AATTAATATC AAGTTATGAT TCAAAAATCA ATTTAGAAGA 444 0 

AATTGTTCAC AATGCTCAAA ATAGTTTGTT TAATGTTAAA CAACAACTTA ACGTAGAGGA 45 0 0 

GGAAGAAGAT TTATGTCTAT AGAGTTTGTT CGTATTGATG ACCGTCTGGT ACATGGTCAA 45 60 

GTTGTCACTA CGTGGCTAAA AAAGTATGAT ATTGAGCAAG TTATCATTGT TAATGATCGC 4 62 0 

ATCTCAGAAG ATAAAACACG ACAATCTATT TTAAAGATTT CTGCACCGGT AGGTTTAAAA 4 68 0 

ATTGTTTTCT TTAGTGTAAA ACGGTTTGTG GAAGTTTTAA ACTCTGTGCC AATAAAAAAG 474 0 

AGAACAATGC TGATATATAC AAATCCAAAA GATGTGTATG ATTCTATTGA AGGAAATTTA 4800 

AAATTGGAGT ACCTCAATGT AGGACAGATG AGTAAAACGG AGGAAAATGA AAAGGTAACG 4 86 0 

GGAGGTGTAG CTCTAGGTGA AGAAGACAAA TATTATTTTA AGAAAATAGT TGATAAGGGA 4 92 0 

ACGAGAGTTG AAATTCAAAT GGTTCCTAAT GATAAAGTTA CAATGTTGGA AAAATTTTTA 4980 

TAAAAATAAT TTAAGGAGGT ACAGTATATG CTATTCACAC AAGCATTACT GGTGACATTA 504 0 

GTTGGGATTA TTGCCACTAT TGACTATAAT GGACCGTTAT TTATGATTCA CCGTCCGTTA 5100 

GTTACAAGTG CAATGGTTGG CTTAGTATTA GGAGATTTCA CCCAAGGTGT TCTTATTGGT 516 0 

TCAGCTCTTG AATTAACTTG GCTCGGTGTA ACAGGTATTG GAGGTTATAC TCCACCAGAT 5220 

ACTATTTCAG GTGCGATTAT TGGTACTGCA TTTGGTATTT TATCTGGTCA AGGAGAAACT 5280 

GCTGGTATCG CTATAGCAGT TCCAATTGCA GTTGCTACCC AACAGTTGGA TGTTCTTGCA 5340 

AAAACTTTAG ATGTTTATTT TGTGAAAAAA GCTGATAATG ATGCTAAAAA CGGAGATTAT 5400 

TCAAAGATCG GTTTTTATCA TTATTCAAGT TTGGTTTTAA TCACGTTATT TAAAATTGTA 5460 

CCAATTTTCC TAGCTATTAT GCTTGGAGGG GAATATG7GG CAGACTTGTT TGCTAAGGTT 5520 
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CCACCAATCG TTATGCAGGG ACTTAACTCT GCAGGTGCTT TACTACCTTC AATTGGTTTT 55 80 

GGTATGCTTT TAAATATGAT GCTCAAGAAA AATATGTGGG TATTCTTGTT GATTGGATTC 5640 

ATTTGTTCTG TGTATGGAGG AATGTCAACC ATTGGGATCT CACTAGTTGG TATTGCGGTA 5700 

GCATACTTCT ACGATATGAT TGGAAGCAAA CCACAAGAAA CAACTTCAAG TAGTGATGTT 57 60 

GAGGAGGATC TTGATCTATG ATGAATAATA AAGTAACTAA AGTTGAACTT AAAAAAGTTT 582 0 

TCAAACGAAG TTTTATGTAT GGTTCTTCAT GGAACTATGA GAGAATGCAG AACCTAGGTT 58 8 0 

TTCTATATAC AATTCTTCCA GTATTGAAAA AACTATACCC AGACAAAGAT TCAGCTTCTC 5940 

CTGCAATGAA ACGTCACCTT GAGTTTTTCA ATACTCATCA AACAGCGGCA CCATTTATTC 6000 

TTGGAGTTAC TTCCGCTATG GAAGAACAAG AAGGAAATGA AGGTGCAGCT TCAATTACTG 6060 

GTATTAAAGT TGGCTTGATG GGGCCACTGG CTGGTCTAGG AGATAGTTTG TTCTGGCTGA 6120 

CACTAGTTCC TATCTGTTTT AGTATTGGTG CGTCTTATTC TAAAGACGGC GGTGCTTTAG 618 0 

GTATCTTTAT CGCCTTAATA TTGTTTAATA TTATTAATAT TCCTGTTAAA TATTTCGGTT 62 4 0 

TGAAATATGG GTATACTAAG GGTTCTAGTC TTATCCAAGA AAATAATACA AAAGGAACAT 6300 

TGAATCGCGT TACGAGTATG GCGACAGCAT TAGGGCTAGT ACTAGTGGGT GGTTTGATTC 63 60 

CATCAATGGT TGGTATTAAT TTTGGATTAG AATTTAAGCA GGGGGAACTT GTTATTTCTG 642 0 

TTCAAGAAAT GATTACAAAA TTAATTCCAG GATTTATCCC TATGGCTTTG ACTTTATTAA 64 80 

TGTGTAAATT AATTAGAAAA GGAAAGAATC CGGTTGTACT AATCTTTAGT GTTATGGCTA 6540 

TTGGAGTTAT TCTAGTTGTT TTAGGAATTT TGAAGTAGTA GAAAGTGTGG AGGTGGTATT 6600 

TGGGATATCA CCTCCATTTT GGAAGAGAGG TAAAGAGTGA AATTATGGTA TAAGAAAGCT 666 0 

GCCGCAAATT GGAATGAAGC CTTGCCGATT GGGAACGGTC ATTTAGGTGG TATGATTTAT 672 0 

GGTTCAGCTA CAAAAGAATG TATTCAACTA AACGATGAGA CTATTTGGTA TAGAGGAAAG 6780 

TCAGATAGAA ATAATCCAGA CTCACTATTG CATCTTAAAA AAATTCGGGA ATATCTTTTA 6840 

GATGGAGAAA TTCAGAAAGC CGAAGAATTG ATAAAGTTAA CAGTGTTTGC TACCCCAAGA 6900 

GATCAAAGCC ACTATGAATT ACTTGGGGAA CTTTACATTG AGCATATAGA TATTCAGTCT 69 60 

TGTGCTCTTT CATTGTATGA AAGAGAGCTA GATTTAGATA CAGCTATTTC TAATGTTGTG 7020 

TTTGAGCCTA ATAGTTGTAA TTTACAAATA AAAAGAGAAT ATTTTACGAG TTTTAATAAG 7080 

AATATTTTAT GTTGCCGTAT AGTGTCATCA GTTCAAAACA CATTAAATTT AAACATTAAT 7140 

TTGGGTAGAA ATAAACGGTT TAATGACGAA GTATCTAAAC TGGATTCAAG TACAATTTTA 7200 

ATGTCGGCCT CTGCTGGAGG TAGAAAAGGT GTTCAGTTTA AAGTAGTATG TCATTCTAAG 7260 

GTTACGGATG GTGAAGTAAG TGTATTGGGA GAGACAATAG TTATTCGGAA TGCTACAGAG 732 0 
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GTATTTCTTT ATCTCAAATC AATGACGGAT TATTGGGGAA ATATAGATAT TTCTTCTCTT 73 80 

CAGGGAGAAT TTAGTAGTAT TGATTACTTT ACAGAAAAAG ATGAACATGT AAAAAAATAT 7440 

CAGGAGCAAT TTAATAGAGT TGATTTTAAA CTAGACTATA GTAAAGGTTG TCTTAGCATT 7 500 

CCAACGAATC TACTTCTTGA AAACACTAAA AAGTATAGTA ACTACTTGAC TAACTTGTTA 7 5 60 

TTTCATTATG GAAGATATCT GTTAATATCG TCTAGTCAAC CGAATGGTTT ACCTGCCAAT 7 62 0 

CTTCAAGGAA TATGGTGTGA TGAATTAAAT CCAATTTGGG GTTCTAAATA TACGATTAAT 7 680 

ATTAATACTC AAATGAATTA TTGGATGGTA GGTCCATGTG ATTTACCAGA AGTAGAATAT 7740 

CCATTATTTG ATATGCTCGA AAGAATGAGA GAACCGGGAA GACTAACCGC TAAGAAAATG 7 800 

TATGGAGCTA GAGGTTTTAC AGCACATCAT AATACGGATG GTTTTGGCGA TACGGCTCCC 7 8 60 

CAATCTCATG CCATGGGGGC TGCAATTTGG GTATTAACTA TTCCATGGTT ATGTACTCAT 792 0 

ATTTGGGAAC ACTATTTATA TTTCCAAGAT GAGCGTATTC TTACGGAACA TTTTGAAATG 79 8 0 

ATAAAAGAAG CATTTCTTTT CTTTGAAGAT TATTTATTTG AGGTGGATGG CTACTTGATG 804 0 

ACAGGTCCAA GTGTCTCACC GGAAAATAAA TATCGCTTAA AAAATGGTAT TGAAGGAAAT 810 0 

GCTTGTCTAT CATCTACAAT TGATAATCAA ATTCTAAGAT ATTTTTGTGA TTCATGCATT 8160 

GGCATTGCAA AACAATTAGG AGACAATTCG GATTTTATTA GTCGTGTGAA GGAGTTAAAA 822 0 

AAGAAACTAC CTAAAACAAA AATAGGTAGT AATGGGCAAA TCCAAGAATG GTTAGAAGAT 82 80 

TATGAAGAAG TAGAGCCTGG GCATAGACAC ATTTCACCTC TATTTGGGCT TTATCCTTAT 834 0 

AATGAGATTG ATATTCATAA AACTCCGGAA TTAGCAGAAG CAGCTAAAAT CACTATCAAT 84 0 0 

AGGAGATTAT CAAACGCTAA TTTTTTATCT TCACAGGAGA GGGAGCAAGC GATTAATAAT 84 60 

TGGTTAGTAA GTGGTTTGCA TGCTAGTACA CAAACAGGTT GGAGTGCTGC ATGGCTGATT 852 0 

CATTTTTTTG CGAGACTATA TCAAGGTGAA CCTGCTTATA ACCAGATTAA TGGTTTGTTA 858 0 

AATAATGCGA CTCTTGGCAA TTTATTTCTT GACCATCCAC CATTTCAAAT TGATGGTAAT B640 

TTAGGTTTGG TGAGTGGAAT TTGTGAATTA TTAGTACAGA GCCATCATAA TTGGTTATCA 8700 

CTAATTCCAG CTTTACCTTC TGCTTGGTCA GAAGGAGAAG TGAAAGGTTT CAGAGTAAGA 8760 

GGAGGATATA AGGTATCGTT TGCTTGGAAA AATGGGGATA TAACATTCCT AAAATTGGAA 882 0 

GGAGGAAACA AAGATCAAAA AGTAAGAGTA AGAATATATG GCAAAAATAC TGATGTACAA 888 0 

AATATTGAAT TGGTATTTAA TTCAGAAAAA ATTATTGAGT TAAATTTTTA GGTATAAGTC 8940 

ATGAATAAAG AAAAAATAAA AAGAAAATTA ATCACAATAT TGTTTGTATG TATTGGGATG 9 000 

TTATGTTTTG GATTGTTAGC AGGAGTTAAG GCTGATAATC GTGTTCAAAT GAGAACGACG 9060 
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ATTAATAATG AATCGCCATT GTTGCTTTCT CCGTTGTATG GCAATGATAA TGGTAACGGA 912 0 

TTATGGTGGG GGAACACATT GAAGGGAGCA TGGGAAGCTA TTCCTGAAGA TGTAAAGCCA 9180 

TATGCAGCGA TTGAACTTCA TCCTGCAAAA GTCTGTAAAC CAACAAGTTG TATTCCACGA 92 4 0 

GATACGAAAG AATTGAGAGA AT GGTATGTC AAGATGTTGG AGGAAGCTCA AAGTCTAAAC 9300 

ATTCCAGTTT TCTTGGTTAT TATGTCGGCT GGAGAGCGTA ATACAGTTCC TCCAGAGTGG 93 6 0 

TTAGATGAAC AATTCCAAAA GTATAGTGTG TTAAAAGGTG TTTTAAATAT TGAGAATTAT 9420 

TGGATTTACA ATAACCAGTT AGCTCCGCAT AGTGCTAAAT ATTTGGAAGT TTGTGCCAAA 94 8 0 

TATGGAGCGC ATTTTATCTG GCATGATCAT GAAAAATGGT TCTGGGAAAC TATTATGAAT 954 0 

GATCCGACAT TCTTTGAAGC GAGTCAAAAA TATCATAAAA ATTTGGTGTT GGCAACTAAA 9600 

AATACGCCAA TAAGAGATGA TGCGGGTACA GATTCTATCG TTAGTGGATT TTGGTTGAGT 9660 

GGCTTATGTG ATAACTGGGG CTCATCAACA GATACATGGA AATGGTGGGA AAAACATTAT 972 0 

ACAAACACAT TTGAAACTGG AAGAGCTAGG GATATGAGAT CCTATGCATC GGAACCAGAA 9780 

TCAATGATTG CTATGGAAAT GATGAATGTA TATACTGGGG GAGGCACAGT TTATAATTTC 9 840 

GAATGTGCCG CGTATACATT TATGACAAAT GATGTACCAA CTCCAGCATT TACTAAAGGT 9900 

ATTATTCCTT TCTTTAGACA TGCTATACAA AATCCAGCTC CAAGTAAGGA AGAAGTTGTA 9 960 

AATAGAACAA AAGCTGTATT TTGGAATGGA GAAGGTAGGA TTAGTTCATT AAACGGATTT 10 020 

TATCAAGGAC TTTATTCGAA TGATGAAACA ATGCCTTTAT ATAATAATGG GAGATATCAT 10080 

ATTCTTCCTG TAATACATGA GAAAATTGAT AAGGAAAAGA TTTCATCTAT ATTCCCTAAT 10140 

GCAAAAATTT TGACTAAAAA TAGTGAGGAA TTGTCTAGTA AAGTCAACTA TTTAAACTCG 10200 

CTTTATCCAA AACTTTATGA AGGAGATGGG TATGCTCAGC GTGTAGGTAA TTCCTGGTAT 102 60 

ATTTATAATA GTAATGCTAA TATCAATAAA AATCAGCAAG TAATGTTGCC TATGTATACT 103 2 0 

AATAATACAA AGTCGTTATC GTTAGATTTG ACGCCACATA CTTACGCTGT TGTTAAAGAA 103 80 

AATCCAAATA ATTTACATAT TTTATTGAAT AATTACAGGA CAGATAAGAC AGCTATGTGG 10440 

GCATTATCAG GAAATTTTGA TGCATCAAAA AGTTGGAAGA AAGAAGAATT AGAGTTAGCG 10500 

AACTGGATAA GCAAAAATTA TTCCATCAAT CCTGTAGATA ATGACTTTAG GACAACAACA 10560 

CTTACATTAA AAGGGCATAC TGGTCATAAA CCTCAGATAA ATATAAGTGG CGATAAAAAT 10 620 

CATTATACTT ATACAGAAAA TTGGGATGAG AATACCCATG TTT AT AC CAT TACGGTTAAT 10680 

CATAATGGAA TGGTAGAGAT GTCTATAAAT ACTGAGGGGA CAGGTCCAGT CTCTTTCCCA 10740 

ACACCAGATA AATTTAATGA TGGTAATTTG AATATAGCAT ATGCAAAACC AACAACACAA 10800 

AGTTCTGTAG ATTACAATGG AGACCCTAAT AGAGCTGTGG ATGGTAACAG AAATGGTAAT 10860 
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TTTAACTCTG GTTCGGTAAC ACACACTAGG GCAGATAATC CCTCTTGGTG GGAAGTCGAT 10920 

TTGAAAAAAA TGGATAAAGT TGGGCTTGTT AAAATTTATA ATCGCACAGA TGCTGAGACT 10980 

CAACGTCTAT CTAATTTTGA TGTGATTCTA TATGACAATA ATAGAAACGA AGTTGCTAAG 1104 0 

AAACATGTTA ATAATTTGTC GGGTGAATCT GTTAGTCTAG ATTTCAAAGA AAAAGGAGCA 11100 

AGGTATATTA AAGTTAAATT ACTAACGAGT GGAGTGCCTT TGAGTTTAGC AGAAGTAGAG 11160 

GTTTTTAGAG AATCAGATGG TAAGCAATCT GAAGAGGATA TAGATAAAAT AACAGAAGAT 1122 0 

AAAGTAGTCT CTACAAATAA GGTAGCTACT C AAAGTT CAA CCAATTATGA GGGTGTAGCT 11280 

GCTTTAGCAG TTGATGGTAA TAAAGATGGA GATTACGGAC ATCATTCGGT GACTCATACT 1134 0 

AAGGCAGATT CTAACGCTTG GTGGCAGGTC GATCTGGGAG AAGAGTTTAC GGTTTCTAAA 11400 

GTTGATATTT ATAATAGAAC AGATGCCGAA CCTCAGCGTT TATCTAATTT TGATGTTATT 11460 

TTTCTATCTT CATCAGGAGA AGAAGTTTTT AGAAGACATT TTGATAAAGT AGTTGATGGT 11520 

TTGTTATCTT TAAAAGTACC TTCTGTAGGG GCTAAGCTAG TCAAAATAGA ATTAAAATCA 11580 

GCAGCTATTC CGTTAAGTTT AGCGGAAGTT GAAGTCTATG GTTCAAAGAG AACTCCGAAG 11640 

AAACTTTCTA ATATTGCATT AACAAAAGAA ACTCGACAGA GTTCAACGGA TTACAATGGT 11700 

TTTTCTCGTC TAGCAGTTGA TGGAAATAAA AACGGAGATT ATGGTCATCA TTCAGTGACT 11760 

CATACCAAAG AAGATTCTCC TTCATGGTGG GAGATAGATT TAGCACAAAC CGAAGAATTA 11820 

GAAAAGTTAA TTATTTATAA TAGAACAGAT GCTGAAATTC AGAGATTATC AAATTTTGAT 11880 

ATTATTATAT ATGATTCAAA TGATTATGAA GTTTTTACAC AACATATTGA CAGTTTAGAA 11940 

AGCAATAATC TATCCATAGA CTTAAAAGGA CTGAAGGGAA AAAAGGTTAG AATTTCTTTG 12 000 

AGAAGCGCAG GAATTCCTTT AAGTTTAGCA GAGGTAGAGG TTTATACTTA TAAGTAATTT 12 060 

TAAAAATTAT CACCCAGGCT ACCGTAAATA TAATGGAGAT GGTAGTATGA AAGAAACAGA 1212 0 

AAAATAAGAG GAAAATAGTA TGATTCAACA TCCACGTATT GGGATTCGTC CGACTATTGA 12180 

TGGTCGTCGT CAAGGTGTAC GCGAATCACT TGAAGTGCAA ACAATGAACA TGGCTAAAAG 12240 

TGTGGCAGAT TTGATTTCAA GCACATTGAA ATATCCAGAT GGGGAACCTG TGGAATGCGT 12 300 

GATTTCTCCA TCTACTATTG GCCGTGTACC AGAGGCTGCA GCTTCCCATG AGTTGTTTAA 12 360 

AAAATCAAAT GTTTGCGCAA CAATTACAGT TACACCATGC TGGTGTTATG GTAGTGAAAC 12420 

TATGGATATG TCTCCAGATA TTCCTCATGC TATTTGGGGA TTTAATGGGA CAGAACGCCC 12480 

AGGAGCTGTC TATCTTGCAG CTGTACTAGC TTCACATGCT CAAAAAGGGA TTCCAGCCTT 12540 

TGGGATTTAT GGAAGAGATG TTCAGGAAGC TAGTGACACA GATATTCCAG AAGATGTCAA 12 6 00 
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AGAAAAACTT TTACGCTATG CGCGTGCAGC TCTTGCAACT GGCTTGATGA GAGACACTGC 12 660 

TTACCTATCA ATGGGTAGTG TTTCGATGGG GATTGGTGGT TCTATTGTAA ATCCGGATTT 12 720 

CTTCCAAGAA TACTTAGGAA TGCGAAATGA ATCGGTAGAT ATGACGGAGT TCACGCGCCG 127 80 

TATGGACCGT GGTATTTACG ACCCTGAAGA GTTCGAACGT GCGCTCAAAT GGGTGAAAGA 12 840 

AAACGTAAAA GAAGGATTCG ACCATAACCG TGAAGACCTT GTTTTAAGCC GTGAAGAAAA 12 900 

AGATAGACAA TGGGAATTTG TTATTAAGAT GTTCATGATT GGACGTGACT TAATGGTTGG 12 960 

TAACCCAAGA CTTGCTGAAC TTGGTTTTGA GGAAGAAGCG GTTGGTCACC ATGCTTTAGT 13 020 

AGCTGGTTTC CAAGGTCAAC GTCAGTGGAC AGACCATTTT CCAAATGGGG ACTTTATGGA 13080 

AACTTTCCTC AATACTCAGT TTGACTGGAA TGGTATTCGA AAACCATTTG TATTTGCGAC 13140 

AGAGAATGAT TCACTAAATG GTGTGTCTAT GCTCTTTAAT TATCTATTAA CAAATACTCC 13 2 00 

ACAAATCTTT GCTGATGTGC GTACTTATTG GAGCCCAGAG GCTGTTAAAC GTGTAACGGG 132 60 

ACATACTTTA GAGGGTCGTG CTGCAGCTGG CTTCTTACAT CTAATCAACT CTGGTTCTTG 13320 

TACATTGGAT GGTACAGGTC AAGCTACTCG AGATGGCAAA CCTATTATGA AACCATTCTG 133 80 

GGAGTTGGAA GAAAGTGAAG TGCAGGCTAT GCTTGAAAAT ACAGACTTCC CACCAGCAAA 13440 

CCGCGAATAC TTCCGTGGAG GAGGATTCTC AACTCGTTTC TTGACGAAGG GGGATATGCC 13500 

AGTAACAATG GTACGTCTCA ATCTTCTAAA AGGGGTTGGT CCAGTGCTAC AAATTGCAGA 13560 

AGGTTACACA CTTGAACTTC CTGAAGATGT TCACCATACT TTAGATAATC GTACAGATCC 1362 0 

AGGATGGCCA ACTACTTGGT TTGCTCCACG TTTGACAGGA AAAGGTGCTT TCAAGTCTGT 13 680 

CTATGACGTC ATGAATAATT GGGGAGCTAA TCACGGAGCC ATAACATATG GACACATTGG 1374 0 

AGCAGACTTG ATTACCTTGG CTTCTATGTT GAGAATTCCT GTCAATATGC ATAATGTACC 13800 

TGAGGAAGAT ATCTTTAGAC CTAAAAATTG GTCCTTATTT GGAACAGAAG ATCTAGAATC 13860 

AGCAGACTAT CGTGCATGTC AGTTGTTGGG GCCACTACAT AAATAAAACT TGTTTATATA 13 92 0 

GGAGGTGAAC TTACGTCCCT CCTATCCTTT TAAAAAGATT TGTTAAACAA TTCACAAATA 13980 

ATTGAAAACG AATACAAAAA GTAATATAAT GATGTTAAAT AGATAGCGCG GAGGCGCAGG 14040 

AGGAAAATTA TATGGCTATA TTTTATGTTC CGGCAGTCAA CCTTATTGGA AAAGGTGTTG 14100 

TAAATGAAGT GGGTCCTTAT ATCAAGGAAC TTGGCTATAA AAAGGCACTT TTGGTGACAG 14160 

ATAAGTACAT CGAAGGCAGT GATATTTTAC CTAAGACTTT AAAACCACTG GATACAGAAG 1422 0 

GAATCGAATA T 14231 
(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 16995 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

AGTTCTCTTA ACTTTTTTAG GATGGCATTC TCCGCTCTCA GGTACTCATT TTCTGCTgAA 60 

GACGTTCTAA TTCTGTCCTC TCTTCAGGTC TCGTTTTTGG CTTACGTCCC ATTTTAGGTA 120 

CTCTCCCTCT TGTTTTCTCA ACAATAGTAT ACCCGTTTTT CCTGTATTGT GCTAGCCAGT 180 

TAAGAAGTAT CGTACGACTT GGGAGACCGT ATTCAAGAGA AACTCTATCT TTAGTCCAGC 240 

CTTCATGTCA GACTTTATTA CTCATTTCTT GTTTTAAATC AGGAGAATAG TAACGATTTT 300 

TTCCTTTTTT GACGAACTCT ATTCCGTAAC GATCAATCAA TTTAATCATG TACCTAATAT 360 

TAGAATTGCT TATCCCAAAT TTATTTGAAA GCTTCTCTAA GCTATATCCT TGTTTTCTAA 420 

GTTCATAGAT CTGAACTTTA TCATCATAAG TTAGTTTCAT AATAAAAACA CCCCAAAACT 480 

TAGATTTTTT CTGTCTAACT TTTGGGGTGT AGTTCATGTA CACCTGATAT GATGCGTTTT 540 

ATAATTTTTA AGCCTTTTTG CCCAGCCTCG TCAAAAGTAA TGTTTTGACA CAAAATCTGT 500 

GACAAAACTT TAGTTTTAAA GGTTTTTAAC TTTGTATATA CTAGTTTTAA GAAAAGGAGG 660 

ATGATCTAAT GGAAGAAAAA GTATCATTGA AAGTCAGGGT TCAAAAACTA GGGACATCGC 720 

TTTCAAATAT GGTTATGCCC AATATTGGAG CATTTATTGC TTGGGGAGTA TTGACTGCCC 780 

TCTTTATCGC TGATGGCTAT CTGCCAAATG AACAGTTAGC TACTGTTGTT GGTCCTATGT 840 

TAACGTATTT ATTGCCAATC CTGATTGGTT ACACAGGTGG ATATATGATC CATGGCCAAC 900 

GTGGTGCCGT TGTAGGAGCT ATTGCTACTG TTGGTGCAAT CACAGGTTCT AGTGTTCCTA 960 

TGTTTATCGG AGCTATGGTA ATGGGCCCAC TGGGAGGATG GACTATCAAG AAATTTGATG 1020 

AGAAGTTCCA GGAAAAAATT CGTCCCGGAT TTGAAATGTT AGTTAATAAC TTCTCAGCTG 1080 

GTCTCGTTGG TTTTGCATTA TTGCTTTTGG CTTTCTACGC AATCGGTCCA GTCGTATCGA 1140 

CTCTTACTGG AGCTGTTGGG AATGGTGTTG AGGCTATTGT CAATGCTCGC CTCCTTCCTA 12 00 

TGGCTAATAT TATCATCGAA CCGGCTAAAG TCCTTTTCCT CAATAATGCC CTCAATCATG 12 60 

GCATTTTTAC TCCTCTGGGA GTAGAACAGG TAGCTCAAGC TGGTAAGTCA ATTCTCTTCC 13 20 

TATTGGAAGC TAATCCTGGA CCAGGTCTGG GAATTCTATT AGCTTATGCT GTATTCGGTA 1380 

AAGGTTCTGC TAAATCTTCT TCTTGGGGGG CAATGGTTAT TCATTTCTTC GGAGGGATTC 1440 

ATGAAATTTA CTTTCCTTAT GTTATGATGA AGCCTACTCT ATTTTTAGCT GCTATGGCAG 15 00 
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GAGGTATCTC TGGAACTTTT ACTTTTCAAC TCTTAGACGC TGGTCTTAAA TCTCCAGCTT 1560 

CACCAGGTTC TATTATTGCG ATTATAGCTA CGGCGCCAAA AGGTGTTTGG CCCCATCTAA 1620 

ATGTTCTTTT AGGTGTTTTA GTGGCAGCAG TTGTTTCTTT CCTTGTAGCA GCCCTTATTC 1680 

TTCATGCAGA CAAGTCAACT GAGGATTCGC TCGAAGCTGC TCAGGCGGCT ACCCAAGCAG 1740 

CTAAGGCTCA GTCTAAAGGT CAGTTAGTAT CAACTTCTGT TGATGCAGTT GTTTCGACAG 1800 

ACTCAGTGGA AAAAATCATT TTCGCCTGCG ATGCTGGTAT GGGAAGCTCT GCTATGGGAG 1860 

CTAGTATTCT TCGAGATAAG GTTAAAAAAG CAGGTCTAGA GATTCCAGTA TCTAATCAGG 1920 

CAATCTCAAA TTTGCTTGAT ACACCAAAAA CATTAATTGT TACTCAGGAA GAACTGACAC 19 80 

CAAGAGCTAA AGACAAGAGT CCAAGTGCTA TTCATGTTTC TGTTGATAAT TTCTTAGCGT 2 040 

CCTCTCGTTA TGATGAAATT GTAGCTTCAT TAACAGGAGC TTCTCCAATA GCAGAAATTG 2100 

AAGGAGATAT ACCAACTTCA GCACCAGTAG ATAGTCAGGA AAGTGACCTT AACCATATTG 2160 

ATGCTGTAGT AGTTGCTTAT GGTAAAGCAC AGGGAACTGC AACTATGGGC TGTGAAACGA 2220 

TTCGGGCTAT TTTTAGAAAC AAGAATATTC GTATTCCAGT TTCTACTGCC AAAATTTCAG 2 2 80 

AATTAGGTGA ATTTAATTCT AAAAACATAA TGATTGTAAC AACTATTTCT TTACAGGCAG 2 340 

AAGTGCAGCA AGCAGCACCG AATTCTCAAT TTCTTATTGT GGATAGTTTA GTAACAACAC 2400 

CAGAATATGA CAAAATGGCT GCTAGAATGT ACAAATAGAA CTAGAGGTTT CTAAATTACG 2460 

AATGCTATTA ACCAAACGAG AAGAACAATT ATTGAAGGCT TTCCTACATG TAGGGAAGCT 2520 

TTCAATGCAA GATATGACTG AAATCTTACA GGTTTCATCT AGAACAATTT ATCGAACTTT 2 580 

ATCAGATTTG ACAGATAGCA TGGAGCAATA TGGAATCGAA ATAACGAAGC ATGGGAAATA 2 64 0 

CTATATTTTG ACTGGAGAGT TGGATGATTT GCCGACAGAA CTTGAAGTGT TAGTTGAGTA 2 7 00 

TAGTCCCCAA GAAAGACAAG AGTTGATTAC CTATCGCCTT CTGACTGAGA GTGGTTTTGT 2 7 60 

CACCAATGAA GCATTGCAAG AGTGCACGAA AGTCAGTAAT GTAACTATTA TTCAGGATAT 282 0 

TTCAGATATT GATAAGCGTC TTTTAGACTT TGATCTGAAA ATTGAACGAC AAAAAGGTTA 2 880 

TCGGATTTCT GGTGATTCAG TTGGTAAGAG AAGATTTTTG GCTATTTTAC TGACAAACTG 2 94 0 

TATCTCAGTA GCAGATTTTT CAACCGGTAA TTTTGGGAGC TTTGATATTT TAGAAGCAGA 3000 

TAGAACTGGG CTGGCCAGTC AGATTGTTAA TAAGCAACTG TCAGGTTTTC CAGATATGGA 3060 

TGCTAGGATG AAGATGTTTT TTGCGATCTT GTTATCTCTT ATAGGTCAGG AGCAAAACAT 312 0 

TGAAAATTCA CCTAATACTA GTAAGCAGGC TTTGGAAATT TCTCAAAAAA TTTTTCAAGC 3180 

TTACTCTAAG CAGACTGCAC AATTTTATAG TATTCAGGAA ATTATCTATT TTGCGAGCAT 3240 

CTTGGATGAA TTAATCATTA AACGTCAGGA CAATCCGCTC TTTACGGAGA AATTTGATGG 33 00 
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TGAATTTTTC TACAATATTT CAAATCTGAT TGATACGGTT TCCATGTATA CCAAGATTGA 3360 

CTTTTTTAAG GACAAGGTTT TATTCAATTT TCTTTTCCAT CATATTCGGC TCAGTTTAGG 3420 

CGTCCCTATC CTTTTTCAGG GTGAAAATTT GCCAGAATCT ATCCAGATTT TAGTTGAAAG 3 480 

GAATAAATTT CTTTATACAG TCATCAGTCT TTTAGTGAAT GATATTTTTC CGAAATATCT 3 540 

T CAT AC AG AG TATGAGTATG GCATGATTGC CCTACATTTT ATCTCTAGCT TAGGCCGTAG 3 600 

TCCAGAGATT TATCCAGTCC GTGTTTTGCT TTTAACGGAT GAACGTCGGG TCACTAGAGA 3 660 

TTTATTAGTC AGTAAAATTA AGAGTGTTGC TCCTTTTGTA GAGTTGATAG ATATTCAATC 3720 

TCTAGTAGAT TACCACAGTA TTGATCTCAG TCAGTATGAT TATATTTTAT CTACCAAGCC 3 7 80 

GCTGACTAAT CAGGAAATCG ATGTAATTTC TAGTTTTCCA ACCGTCAAAG AATTGCTTGA 3 840 

ATTACAGGAA CGACTTCAGT ATGTACAGGC ACATCGTACA ATTGTCGCGC GTGATGCTAT 3 900 

CGCTCCAGAG AAAAGTTATG ACTTGCAAGA TTATTTAATA TCTAGTAGTC AGCTTTTGAG 3960 

TCAATTCGAG TTGGTTCAAT TGGAGAATAA TCAATCATTT GAGCACACGG TAGAACAAAT 4020 

CATCCAATAT CAGAAGAATG TGAGTGACAG AGCTTACCTA ACAAGAAAAT TGTTATCTCA 4 080 

CTTCCAGAAT AGTCCTATGG CTATTCCTAA TACTGGTCTG GTGCTTTTAC ATAGTCAGTC 4140 

TAGCAAAGTA ACAACAAATA GTTTTACTAT GTTTGAACTC AAACTACCTA TCTCCGCATT 4 2 00 

GTCAATGAAA CGAGAGGAAG AAGAGGTCAA AAGGTGTCTG CTAATGCTAA TGTCTAAAGA 42 60 

AGCTAGCGAG GAAGCTAGAG ATTTAATGAC AGCTATTAGT CAGTCGATTA TTGAAAATCA 4320 

TCTTTATACA GAGATTTACA AGACGGGAAA TCAATCCATT ATTTATCAGA TGCTAAATAC 43 8 0 

TATTTTTAAC GAAAAAATTA AGAAATTGGA GAACTAATAT GAAACTTGAA AAACATTTGA 4440 

TTAAGCTTAA TAAACAATTT TCTAACAAGG AGGAAGCTAT TTGTTATTGT GGGCAAGTTC 4 500 

TTTATGAGGG TGGATATGTT AATGAAGACT ATATTGAAGC CATGATTGAG CGAGATAAAG 4 5 60 

AGCTATCTGT TTACATGGGT AACTTTATCG CCATACCGCA TGGAACAGAT GCAGCAAAAA 4 62 0 

ATGATGTCCT CAAGTCTGGT ATTACAGTCG TTCAAGTCCC TAGAGGGGTT GATTTTGGGA 4 6 80 

ATGTATCTAA CCCTCAAGTG GCAACGGTTC TTTTTGGTAT TGCTGGTATT GGTAATGAAC 4 740 

ACTTAGAAAT T AT T C AG AAA ATTTCTATCT TCTGTGCAGA TGTAGATAAT GTTCTTAAAC 4 800 

TAGCAGATGC TCAGTCAAAA GAGGAAGTAT TGCGCTTATT TGATGCTGTT GAATAATTGA 4860 

ATTTAGTCAT TTGTCATCTA GTATATATGT CCCTCAAATA GGAAAAGGAG AAATTGAATG 4 920 

AAACATTCTG TTCATTTTGG TGCCGGTAAT ATCGGTCGTG GTTTTATAGG TGAAATTCTA 4 9 80 

TTTAAAAATG GTTTCCATAT TGATTTTGTG GATGTCAATA ATCAGATAAT TCATGCTCTG 5040 
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ACTAATGTGG CTGGCATTAA TAGCAAAGAA CATCCTGAGC AAGTCATTGA AGCGATTCAA 5160 

AAGACGGATA TTATTACTAC TGCAATCGGA CCTAATATAC TCCCTTTTAT CGCCGAACTT 522 0 

CTAGCCAAAG GAATCGAAGC TCGCCGAGTT GCAGGAAATA CACAGGCATT GGATGTTATG 52 80 

GCCTGTGAAA ATATGATTGG CGGGTCTCAA TTTCTTTATC AAGAAGTCAA GAAATATTTA 5340 

AGTCCGGAAG GTTTGACATT TGCTGATAAC TACATAGGTT TTCCAAATGC TGCAGTAGAC 54 00 

AGGATTGTTC CAGCACAAAG TCACGAAGAT TCCCTTTTTG TTGTGGTCGA GCCCTTTAAT 5460 

GAATGGGTCG TGGAAACCAA GCGTCTTAAA AATCCAGATT TACGTCTAAA AGATGTGCAT 552 0 

TATGAAGAAG ATTTAGAACC CTTTATTGAG CGAAAACTTT TTTCAGTCAA TTCTGGACAT 5580 

GCAACTTCAG CTTACATTGG TGCGCATTAT GGTGCCAAGA CAATTTTGGA AGCTCTTCAA 5640 

AATCCTAATA TTAAATCTCG GATTGAATCT GTATTAGCTG AAATTCGGAG TCTCTTGATT 570 0 

GCCAAATGGA ACTTTGATAA AAAAGAATTG GAGAATTATC ACAAAGTCAT TATAGAACGA 57 60 

CTTGAAAACC CTTTCATAGT GGACGAGGTT AGTCGCGTAG CTCGTACTCC AATCCGAAAA 582 0 

TTAGGCTATA ATGAACGATT CATCCGGCCG ATACGTGAAT TGAAAGAACT CAGTTTGTCA 5880 

TATAAAAACC TACTTAAAAC AGTTGGCTAT GTCTTTGACT ATCGCGATGT AAATGATGAA 594 0 

GAAAGTATTC GATTAGGTGA ATTGTTGGCT AAACAATCAG TCAAAGATGT TGTTATACAA 600 0 

GTTACAGGTT TAGACGACCA AGAATTGATT GAGCAAATTG TAGAGTATAT TTAATCTTTT 6060 

TCGAAAATCT CTTCAAATCA GGTTAGCATC GCTTTGTCTT AGGCATATGT TGTTCTATCT 612 0 

ACAACCTCAA AGCAGTGCTT TGAGCTGACT CCGTCAGTCT TATCTGCAAT CTCAAAACAC 618 0 

TGTTTGAGTT ATCTGCGGTA ATCTTTCTAG CTTGTCTTTG ATTTTTGTTG TTATTTATAA 624 0 

GGTAAAAGAA GCTGGACAAA AAGTCTTCAA AATCGGGAAA AGGCAGCCTA TCGGGTGTTC 63 00 

AAAAATCTTG ATAGGATGTC CTTTATTATG GAAAGCCTTA TTGGATTTTC TCCTCAGATT 63 60 

GAGTTTTTGA TCAGCTTTAT GAGATAGGTC TTGCTAGAGA TGTAGCCCAT CATGTTATTT 64 2 0 

TTATGGACAG TGGGAAAATT GTTGAAAAAA ATAATGCCCA TCAATTCTTT AGTCGTCCAA 64 80 

GAGAAGAACG AACCAAGCAA TTTTGGAACG AATTCTTTCG AATGCGATCT ATATAGTAAA 6540 

ATGAAACAAG AACAGGACAA ATCGATCAGG ACAGTCAAAT CGATTTCTAA AAATGTTTTA 6600 

GAAGTAGAGG TGTACTATTC TAGTTTCAAT CTACTATATA ACTGAAAAAT TAGATAAATT 6660 

AGTTTTGGAA AATGACTAAC CAAAAGATAT CCAAAGTAGT CTAAAATTGT CTATACTTTA 6720 

TGAGTGTTTT AGTTAGGAAA AAGGCTTGTT GTCTATAATT GTCTGCATTA GTCTAGATTT 6780 

TATTTATAGA AAATGTTATA ATAGACTGTA TTTAAAAAAT TTTAAGGAGA AATGACAGAA 684 0 
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TGTCTGTATC ATTTGAAAAC AAAGAAACAA ACCGTGGTGT CTTGACTTTC ACTATCTCTC 6 900 

AAGACCAAAT CAAACCAGAA TTGGACCGTG TCTTCAAGTC AGTGAAGAAA TCTCTTAATG 6 960 

TTCCAGGTTT CCGTAAAGGT CACCTTCCAC GCCCTATCTT CGACCAAAAA TTTGGTGAAG 7 020 

AAGCTCTTTA TCAAGATGCA ATGAACGCAC TTTTGCCAAA CGCTTATGAA GCAGCTGTAA 7 080 

AAGAAGCTGG TCTTGAAGTG GTTGCCCAAC CAAAAATTGA CGTAACTTCA ATGGAAAAAG 7140 

GTCAAGACTG GGTTATCACT GCTGAAGTCG TTACAAAACC TGAAGTAAAA TTGGGTGACT 7200 

ACAAAAACCT TGAAGTATCA GTTGATGTAG AAAAAGAAGT AACTGACGCT GATGTCGAAG 72 60 

AGCGTATCGA ACGCGAACGC AACAACCTGG CTGAATTGGT TATCAAGGAA GCTGCTGCTG 7 320 

AAAACGGCGA CACTGTTGTG ATCGACTTCG TTGGTTCTAT CGACGGTGTT GAATTTGACG 7 380 

GTGGAAAAGG TGAAAACTTC TCACTTGGAC TTGGTTCAGG TCAATTCATC CCTGGTTTCG 7440 

AAGACCAATT GGTAGGTCAC TCAGCTGGCG AAACCGTTGA TGTTATCGTA ACATTCCCAG 7 5 00 

AAGACTACCA AGCAGAAGAC CTTGCAGGTA AAGAAGCTAA ATTCGTGACA ACTATCCACG 75 60 

AAGTAAAAGC TAAAGAAGTT CCGGCTCTTG ACGATGAACT TGCAAAAGAC ATTGATGAAG 7 62 0 

AAGTTGAAAC ACTTGCTGAC TTGAAAGAAA AATACAGCAA AGAATTGGCT GCTGCTAAAG 7 680 

AAGAAGCTTA CAAAGATGCA GTTGAAGGTG CAGCAATTGA TACAGCTGTA GAAAATGCTG 7740 

AAATCGTAGA ACTTCCAGAA GAAATGATCC ATGAAGAAGT TCACCGTTCA GTAAATGAAT 7 8 00 

TCCTTGGGAA TTTGCAACGT CAAGGGATCA ACCCTGACAT GTACTTCCAA ATCACTGGAA 7860 

CTACTCAAGA AGACCTTCAC AACCAATACC AAGCAGAAGC TGAGTCACGT ACTAAGACTA 7 920 

ACCTTGTTAT CGAAGCAGTT GCCAAAGCTG AAGGATTTGA TGCTTCAGAA GAAGAAATCC 7980 

AAAAAGAAGT TGAGCAATTG GCAGCAGACT ACAACATGGA AGTTGCACAA GTTCAAAACT 804 0 

TGCTTTCAGC TGACATGTTG AAACATGATA TCACTATCAA AAAAGCTGTT GAATTGATCA 8100 

CAAGCACAGC AACAGTAAAA TAATCTTAAT AAACAGAAAA CCCACCTGAA TTGGTGGGTT 8160 

TTCTGATGCA CTATTTTCCA AAAATCTCTT TGAGGTCTGT GTCTGTAATC CCAATCATGG 822 0 

CTGGGATGCG GTCCCAGTTT TCTTCGGTTA GGATGTAGGA TTGTTCAGAG GCACTTGATG 82 8 0 

TGACTGTTTC AGAGACAGCT TGTTGCTTTT CTTCAACATT CTCCAGTAGA TCACTGAAGC 834 0 

GTTCAATCAG ATAGGTTTTT CGGGCAGTTC CGATGTGTTG GGTAGCATAG TCGAAGGCTT 84 00 

GTAATTCGCC TAGTAAGATG AGTTTGCTTT TGGCACGTGT AATGGCTGTG TAGATGAGAT 84 60 

TTCGCTCCAG CATACGTCGG CTAGCACTAG TAATCGGTAG GATGACAACT GGGAACTCAC 852 0 

TTCCCTGAGA CTTATGAATA CTCATGGCAT AGGCCAAGCG AATCTTGTAC CATTCGTTAC 8580 
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GGGGGTAAGA GACTTCATTA CCATCAAAAT CAATGACAAT CTCGTCTTGT TTCGATTCGG 8640 

TGTATTTACC AGGAATCAGG TCTGTGATAG CTCCTAAATC CCCATTAAAG ACATTGATTT 8700 

CAGCATCGTT AACCAAATGA ATGACCCTGT CTCTCTTACG ATAGTGACAC TGAGGAGCTT 87 60 

CAAAACTGAG TTGATCTTTT TGTGGGGGAT TGAGCAGGTC TTGCATGAGC TGATTGATAG 8 820 

CATCAATCCC TGCCGTCCCT CGGTACATAG GAGCCAGAAC TTGGATATCA CGGGCGGGAA 8880 

TACCATTTCT GAGGGCGGCA CCTAAGATTT TTTCAATGGT GGCAGGAATA TGGCCACTAG 8940 

CAATTTCAAA GTAGGAACGG TCAGCTTTTT TTTGGGTGAA ATCAGCTGGC AAGATGCCCT 9000 

GTCGAATCTG ACTAGCTAGG GTGACGATGG TTGATTCTTT GCTTTGTCGA TAAATTTTTT 9060 

CCAAGCGAGT CTGAGGAATC AAAGGAATAT GAAGTAGATC CGCTAGAACC TGTCCAGGAC 9120 

TGACAGAAGG TAGCTGATCA CTGTCACCTA CGATGAGGAT CTTACTGTTA GAAGAGATAT 9180 

TGGAGAAGAG TTGATTGGCC AGCCAAGTAT CTACCATAGA GAATTCATCC ACGATGATAA 9240 

AGTCAGCATC TAGGTAATCT TCCAGATGAC TGGTATCATC GTCACCTGTC ATTCCCAAGT 9300 

GGCGATGTAT GGTCGCGCTA GGCAAACCTG TCAATTCATT CATGCGACGA GCAGCTCGAC 9360 

CAGTTGGAGC AGCAAGAAGA ATGGGCAGAT TGCTTTTCTT CCTGAAGTCA AGTCCTTCTA 9420 

AAAGGGCATA AACAGCAATG ATTCCATTGA TAACAGTTGT CTTACCAGTA CCAGGCCCAC 94 80 

CTGTCAGGAT AAAGACCTTA TTCTGGATAG CATCACAGAT AGCCTGTTTT TGAATGTTAT 9540 

CATACTCAAT TCCCAGTTCT TGCTCGACAG TAGTGATATG TTTTTGAATG GTTTCTAAAT 9 600 

CATGACTCTT CTGTTTTCCT TTTTCAAGGA TACGAACCAA GTGACTGCGG ATGCCTTCCT 9660 

CAGCGAAAAA GAGGCTGTTG TCAAAGATCT TGGTATCAAT CTGCTGAACC TTGTCTTCTT 9720 

CGATCAGGTA GGAGAGCTCT TGGGCAACTT GGCTGGGGTC TAGTTCCACG GGACGGGAAG 97 80 

ACTCAAGGAG AGTAAGGGTT TGTTCCAGCA AATCCCGTGC TTCAACATAG GTGTCCCCTG 9840 

TTTCCATACA GGCCTGAAAA AGACTGTGAA CTAGACCGGC GCGGAAGCGT TCAGGAGCCT 9 9 00 

GACTTTCGAT GCCTAGTTCC TCAGCTAGTT GGTCAGCAAT GGTAAAGCCC AAACCCTTGA 99 60 

TATCCTCAAC CAGCTGGTAG GGATAATTTT CAACCACATC AAGGGTTTCT TCCTTGTAAA 1002 0 

AGTCTTGAAT CTGAAAGGCT AGTTTGTTGG GAATGCCGTA GTTGGCTAGT TTGGCCAAAA 10080 

TCATCTCCGT TCCGTAGTTG AGACGGAGAG TGGAGACGAA AGCCTCGCGA TTTTTGGCAG 10140 

AGAGTCCTGC GATGCCTTCT AACTTTTCTG GGTGTTGCAA AATTTCGTCA ATGGTATTTT 10200 

CGCCATAGGT ATCCACGATT TTCTGAGCTG TCTTGAGACC AATCCCCTTG AAATGGCTAC 10260 

TTGAAAAGTA CTTGACCAAG CCCTTACTAG TTGGTTTTGC GCGATCATAA CGACTGATTT 103 2 0 

GCAGTTGTTC TCCATACTTG GAGTGCTGGA CAATTTGCCC CCAAAAAGTA TAGTCTTCGC 103 8 0 
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CCTCAATTAC ATCAGCCATG GTTCCTGTGA CAATGATTTC AAAATCATCA AAATCCTCTG 10440 

CGTCCGTATC GTCGATTTCT AGGAGGAGGA TGCGATAAAA ATTGCTGGGA TTTTCAAAAA 10500 

TAATCCGTTC AATAGTTCCT GAAAAATAAA CTTCCATAAA ATTCCTTTGC ATGAATAGGT 10560 

GAGAGTTGGG ATTGTTTTTA TTTTATACTC TTCGAAAATA TCTTCAAACC ACGTCAGCTT 10620 

CCATCTGCAA CCTCAAAACA GTATTTTGAG CTGACTTCGT CAGTTCTATC CACAACCTCA 1068 0 

AAACACTGTT TTAAGCAGCC TACGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTCATTGAG 1074 0 

TATTTGTAAA TAAACAATCA CTTCTCACGA TAGAAGAAGA GGCTGAGATT GGTGATTCTC 10800 

TGCCTCTTAG GTTTCTTAAA ATGTTCCGAT ACGGGTGATT GGCCATAAGC GGAATTTAGC 10860 

TTCCCCTGTG ATATCTTTTG CTTTGAAGGT ACCTACGTGG CGGCTGTCGC TCGAAACCAA 10920 

GCGGTCATCT CCGAGGAGAA GGTATTCTCC TTCTGGAACA CTAAAGCTAA AGTTGGTGTT 10980 

GTAGTTGACA TCAACTGTGA AGGCTTGAGC TTTTTGAGCG ATACTTCTAA AGAAAGTTCC 11040 

TTTATTTCCT TCAAAGCCCT TGCCTGAGTA AGTGCTTTGG AGTTTGTCAT CCTTGAAGCG 11100 

TTTGATATAG TCTGCTAGAT AAGGCTCGTC CGTTTCTTTG TCATTGATGT AGAGTTTATC 11160 

ATTTTCGTAA CGAATGGTGT CGCCAGGCAT TCCAATCACG CGCTTGACGA TGTCCTTATT 11220 

GCCATCTTCC TCATGGGCCA CCACGATATC AAAACGGTCA ATAGGAAGGT GTTTTACAAC 11280 

GAAGAGAATT TCGCCATCCG CTAGGGTCGG ATCCATGGAA TGTCCTTCTA CGCGAACATT 11340 

GCTCCAAAAA AAGATACGAC TTAAAGCTAG TAATGACAGA ATTAGGAGGA ACAATCCCCA 11400 

CTCTTTTAAG AAATTTTTAA ATGAATTCAT AACTTACCTT TCTAAGCGTT TTTTCGCTTT 11460 

TTCAGTGTTT TTAAAGTGCA ATTTGGCGCA GAAGCTGAGT CCCTGCATAC CATAGGCTTG 11520 

CAAAATCTGG CTAGCCACCT TGTCAGAAGC CGTTCCAGCT CCACTTGGGA GCTGATAACC 11580 

CAGTTCTCGT CCCAAATTTT CAAGATTTTC CAGAAAGAGA TCACGCGCAA TGACAGAAGA 11640 

AACTGCGACA GACAAGTATT TGCCCTC'AGC CTTTTCTTCT AAGCTGATAG GATTGCTGAA 11700 

ACGATTGGCC TCTTGTGCCA AGTACTTGTC ATAATTTTTA GCACTGGTAA AGGCATCAAT 117 60 

CACAATTTTC TCAGGCTGAA CACCTTTTTG AAGGAGGAGA TAGATAGCCT GATTATGGAG 11820 

GGCAACCTTA ACCGAAACAG CGTTGTAGCG GTCTCCGATG ACCTCGTTGT ACTTGCTGGG 11880 

TGAGAGAAGG AGTGCCTGGT GCTGAATTTT TTCCTTGAGA ATAGGAGTAA TCTGACGGAT 11940 

CTTTTGGTCG GTCAGAGTCT TAGAATCCCC CACACCGAGT TTTCGTAAAA AGTCGTGCTG 12 000 

GTCAGGTGTG ACAAAGGCAG CCACAACTGC AAGCCCACCA AAGTAGGAAC CATTTCCCAC 12050 

CTCATCTGTC CCAATTAAAG GAAGATTTTG TCCGCTGGTT TGCTCTACAG CTTGATAGCC 12120 
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AAAGAAACTG GCGTATTTTT CAGCCCCTTC ACCCTGAAGC AAGATTTTTC CAGAAGTATA 12180 

G AT AG AAAC C GTTGCTTGAG GTAGTTTCAA AAAGTAGCGG ATATAGGGAT TCTTGCTAGG 12240 

AGCCAGACTG GTTTGATAGT GTTCAAGAAA AGCCTGAATA TCCTTTTCGC TTGGTGTGAG 12 300 

TGTGATACTT GCCATAGTTT CTATTGTACC ACAAAAGCAG TAAAATTTGT AAAAACTGAC 123 60 

AAAATTAGCG AATTTTGGTA TAATATCGTG AGGTGAATTT TATGGCAAAT CTAAATCGAT 1242 0 

TCAAATTTAC ATTCGGGAAA AAATCGTTAA CCTTGACAAG CGAACATGAC AACCTTTTTA 124 80 

TGGAGGAAAT CGCTAAGGTT GCGACAGAAA AATACCAAGC AATTAAAGAA CAAATGCCTA 1254 0 

GCGCAGATGA TGAAACAATC GCTCTTTTGT TGGCAGTCAA CTGTTTATCA ACTCAGCTCA 12 600 

GCCGTGAGAT TGAATTTGAC GATAAGGAGC AAGAGCTAGA AGAACTCCGT CACAAGCTTG 12660 

TGACTTGTAA GCAAGAACAG AGCAAGATTG AGGATTCCTT ATGATTTCAT TCCTTCTTCT 1272 0 

ATTGGTCTTG GTTTGGGGAT TTTATATCGG CTATCGGAGA GGCCTGCTCT TACAGGTTTA 127 80 

TTACCTGATT TCAGCCATGG CATCGGCTTT TATGGCTGGC CAGTTTTATA AGGGGCTTGG 12 84 0 

AGAGCAATTC CATTTATTGC TCCCTTATGC AAATTCGCAG GAAGGTCAGG GGACTTTCTT 12 9 00 

TTTCCCATCG GATCAACTCT TTCAGCTGGA TAAGGTCTTT TATGCAGGTA TCGGCTACTT 12960 

GCTTGTATTT GGGATTGTCT ATAGCATTGG TCGTTTACTT GGTCTTCTCT TACACTTGAT 1302 0 

TCCTAGCAAA AAACTGGGTG GTAAGTTGTT CCAAGTTTCA GCAGGTATCT TGTCCATGTT 13 080 

GGTGACCTTA TTTGTCTTGC AAATGGCCTT GACAATCTTG GCGACCATCC CCATGGCAGT 13140 

TATACAAAAT CCTCTTGAAA AGAGTATCGT CGCAAAACAC ATCATCCAGA GCATACCGGT 132 00 

AACAACCAGT TGGCTCAAAC AAATCTGGGT GACAAATTTA ATCGGATAAA AAGGGCAGGA 132 6 0 

GTTTTCCTAG CCCTTTGTTT ACAGATTTGA CTCGAATCTA TCAGAATGTA AAAAGCTACC 13320 

ACACCTAGAC ATTCAAAGAC AAGGAAATAA AGATGAATAA GAAAATATTA GAAACATTAG 133 80 

AGTTCGATAA GGTCAAGGCC TTGTTTGAGC CTCATTTGTT GACCGAGCAG GGCTTGGAGC 13440 

AATTGAGACA ACTGGCTCCG ACTGCCAAAG CAGATAAAAT CAAACAGGCT TTTGCTGAGA 13500 

TGAAGGAAAT GCAGGCTCTT TTCGTCGAGC AACCGCATTT TACTATTCTC TCAACTAAGG j.3 5 60 

AAATTGCAGG AGTCTGCAAG AGGTTGGAGA TGGGAGCGGA TCTCAATATC GAGGAGTTCC 1362 0 

TACTCTTGAA ACGCGTGCTT CTTGCCAGCC GAGAACTTCA AAATTTTTAC ACCAATCTGG 13680 

AAAATGTCAG CTTGGAAGAA TTAGCCCTTT GGTTTGAGAA ATTACATGAT TTTCCGCAAT 1374 0 

TACAAGGAAA TCTTCAGGCC TTTAATGATG CGGGTTTCAT TGAAAATTTT GCCAGTGAAG 13800 

AATTGGCGCG AATCCGTCGA AAAATACATG ATAGCGAGAG TCAGGTACGC GATGTTTTAC 138 60 

AAGACTTGCT CAAGCAAAAA GCGCAGCTGT TGACGGAAGG AATTGTTGCT AGCAGAAATG 13920 
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GCCGTCAGGT TTTACCAGTC AAAAACACCT ACCGCAATAA GATTGCAGGT GTCGTTCATG 13980 

ATATTTCTGC TAGTGGAAAC ACCGTCTATA TCGAACCCCG TGAGGTAGTC AAACTGAGCG 14040 

AAGAAATTGC TAGTCTGCGA GCAGATGAGC GCTATGAAAT GCTTCGCATT C T C C A AG AAA 14100 

TTTCTGAGCG TGTCCGCCCT CATGCGGCTG AGATTGCTAA TGACGCTTGG ATTATCGGTC 14160 

ATCTGGACTT GATTCGTGCC AAGGTTCGAT TTATCCAAGA AAGACAAGCA GTCGTGCCTC 14220 

AGCTGTCAGA AAATCAAGAG ATTCAACTGC TCCATGTCTG CCATCCTTTG GTCAAAAATG 14280 

CCGTCGCAAA TGATGTCTAT TTTGGTCAAG ATTTAACAGC TATTGTCATT ACAGGTCCCA 143 40 

ATACAGGTGG GAAGACCATC ATGCTCAAAA CTCTGGGCTT GACACAGGTC ATGGCCCAGT 144 00 

CAGGATTGCC GATTTTAGCA GACAAGGGAA GTCGTGTTGG TATTTTTGAA GAAATCTTTG 14460 

CTGATATTGG AGATGAGCAG TCTATTGAGC AGAGCTTGTC TACCTTCTCT AGTCATATGA 14 520 

CCAATATCGT GGATATTCTT GGCAAGGTCA ACCAACATTC ACTCTTACTT TTGGATGAGT 145 80 

TGGGGGCTGG TACTGATCCC CAAGAGGGAG CAGCCCTTGC CATGGCTATT CTGGAGGACC 14 640 

TTCGCCTGCG TCAAATCAAG ACCATGGCGA CGACCCACTA TCCAGAACTC AAGGCCTACG 147 00 

GTATTGAGAC AGCCTTTGTG CAAAATGCCA GTATGGAGTT TGATACTGCA ACTCTTCGCC 147 60 

CGACCTATCG CTTTATGCAG GGTGTTCCTG GCCGAAGTAA TGCCTTTGAA ATTGCCAAAC 14 820 

GTCTAGGCCT ATCTGAAGTT ATCGTAGGAG ATGCCAGTCA GCAGATCGAT CAGGACAATG 14 880 

ACGTCAATCG TATCATTGAG CAATTAGAAG AGCAGACGCT GGAAAGCCGC AAACGTTTGG 14 940 

ACAATATCCG TGAGGTGGAG CAAGAAAATC TCAAGATGAA CCGTGCGCTA AAAAAACTCT 15000 

ACAACGAGCT TAATCGTGAA AAGGAAACCG AGCTTAACAA GGCGCGTGAA CAGGCTGCTG 15060 

AGATTGTGGA TATGGCCCTA AGTGAAAGTG ACCAGATTCT CAAAAATCTC CACAGTAAAT 1512 0 

CCCAACTCAA GCCCCACGAA ATCATTGAAG CCAAGGCCAA GTTGAAAAAA TTGGCTCCTG 15180 

AAAAAGTGGA CTTGTCTAAA AATAAGGTCC TTCAAAAGGC CAAGAAAAAA CGAGCTCCAA 152 40 

AGGTGGGAGA TGATATCGTG GTTCTCAGTT ATGGTCAGCG TGGTACCTTG ACCAGTCAAC 153 00 

TCAAGGACGG TCGCTGGGAA GCCCAAGTTG GCTTGATTAA GATGACCTTG GAAGAGAAAG 15 360 

AGTTTGATCT TGTTCAAGCC CAGCAAGAAA AACCAGTCAA GAAGAAACAG GTCAATGTTG 15420 

TGAAACGAAC TTCTGGGCGA GGACCTCAAG CTAGACTGGA TCTTCGAGGC AAGCGCTATG 154 80 

AAGAAGCCAT GAATGAGCTA GATACCTTCA TCGACCAAGC CTTGCTTAAC AATATGGCTC 15540 

AAGTTGATAT CATCCATGGT ATCGGAACAG GAGTCATCCG TGAAGGAGTT ACCAAATACT 15 600 

TGCAAAGAAA CAAACATGTC AAGAGTTTCG GCTATGCCCC ACAAAATGCT GGAGGCAGTG 15660 
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GTGCGACTAT TGTCACTTTT AAAGGATAGC AGTATTCTGG ACTTTATAAA GTAAAAACTG 1572 0 

TTGAACTAAT TTTTACTAAT AAACACATTG ACAAAAGCCA ACATTTTTTG T AAAAT TAG A 157 8 0 

ATCAATTAAA TACCAACACC GAATGAAGTT TAATAGAAGT GGGGAATCGT TTGATTTTCC 15840 

ATGACTGTAA ATGGACGGAA CTCTGGAGAG ACCGTAAAGG CACCGAAGGG CAAGGCAGGC 15900 

AACTGCTCAA ACTCTCAGGT AAAAGGACAG AGCTAGGATA GACCGCTTTT TAGCATTTAT 159 60 

CTAAGCATTC CAGAGTACAT GTATCTTGCA TGTGCTCTTT CTTTTGGGGT TGAAACGATA 1602 0 

GGAGAAGGAA ATGTTAGAAT TGCTTAAATC AATCGATGCT TTTGCTTGGG GACCGCCCCT 16080 

CTTGATTTTA TTGGTCGGAA CAGGGATTTA CCTAACTATT CGGCTAGGAC TCTTGCAGGT 16140 

TTTGCGTCTA CCCAAGGCCT TTCAGCTTAT TTTTATCCAG GATAAGGGAC ATGGTGATGT 16200 

ATCCAGTTTT GCAGCTCTGT GTACAGCCTT GGCATCAACT GTTGGAACAG GAAATATCAT 16260 

AGGAGTTGCG ACGGCTATCA AGGTTGGTGG ACCAGGAGCT CTATTTTGGA TGTGGATGGC 16320 

GGCTTTCTTT GGAATGGCTA CCAAGTATGC GGAAGGACTC TTGGCCATCA AATACCGCAC 16380 

CAAGGACGAC CATGGTGCAG TAGCGGGAGG TCCCATGCAT TATATCCTTC TAGGGATGGG 16440 

AGAAAAGTGG CGACCACTTG CTGTTTTGTT TGCAGTAGCA GGAGTATTGG TTGCTCTCTT 16500 

GGGAATCGGA ACCTTCACCC AAGTCAACTC GATTGCAGAA TCTATCCAAA ATACAACGAC 16560 

GATTTCGCCA GCCATCACAG CTCTCGTCTT GTCTGTCTTT GTAGCGATTG CAGTCTTTGG 1662 0 

TGGACTCAAG TCTATTTCTA AGGTTTCAAC TACTGTTGTT CCTTTTATGG CCATCATTTA 16680 

TATCTTAGGA ACTCTTACAG TTATTTTCTT TAATATCGGA AAAATCCCTG GCACAATCGC 16740 

TTTAGTCTTT ACCTCAGCTT TTAGTCCCCT TGCTGCGGTA GGTGGATTTG CTGGTGCTAG 1680 0 

CGTTCGGATG GCTATTCAAA ATGGTCTGCC GCGTGGTGTG TTCTCAAACG AATCTGGTCT 16860 

GGGTTCTGCT CCTATTGCAG CTGCAGCTGC CAAGACAAAT GAACCAGTAG AGCAAGGTTT 16920 

GATTTCCATG ACAGGAACCT TTATTGATAC CCTCATCATT TGTACTCTAA CTGGTTTGAC 16980 

CATCTTGGTA ACTGG 1699 5 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 28473 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 83: 
CCGGGGCTTT TGTAGTATAA TAGAGATACG TTTTGAAAGT AGGAGGTATC TATGGACTTA 60 
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ACTAAGCGCT TTAATAAACA GTTAGATAAA ATTCAAGTTT CGTTGATTCG TCAGTTTGAC 120 

CAGGCTATTT CGGAGATTCC TGGGGTCTTG CGTTTGACCT TGGGGGAACC TGATTTTACA 180 

ACGCCAGACC ATGTCAAGGA GGCGGGCAAG CGAGCGATTG ATCAGAACCA ATCCTACTAT 240 

ACAGGGATGA GTGGTCTGCT GACTCTACGT CAGGCAGCCA GTGACTTTGT TAAGGAAAAG 3 00 

TACCAACTGG ACTATGCTCC TGAAAATGAA ATCTTGGTTA CAATTGGGGC GACAGAGGCT 3 60 

TTATCTGCGA CTTTGACGGC TATTTTGGAA GAGGGAGACA AGGTACTTTT GCCAGCTCCT 420 

GCTTATCCAG GCTATGAACC GATTGTTAAC TTAGTTGGGG CAGAAATTGT TGAGATTGAT 4 80 

ACGACTGAAA ATGGTTTTGT CTTGACTCCT GAGATGTTGG AGAAGGCCAT TTTGGAGCAG 540 

GGTGATAAGC TCAAGGCGGT TATTCTCAAC TATCCAGCCA ATCCGACAGG AATTACCTAC 600 

AGTCGAGAGC AGTTAGAGGC CTTGGCAGCT GTTTTACGCA AGTACGAAAT TTTTGTTGTC 660 

TGTGATGAGG TTTACTCAGA ATTGACCTAC ACAGGCGAAG CCATGTGTCT CTAGGAACGA 720 

TGTTGAGAGA CCAGGCTATT ATTATCAATG GTTTGTCTAA ATCGCATGCC ATGACAGGTT 7 80 

GGCGTTTGGG GCTGATTTTC GCTCCTGCGA CCTTCACAGC CCAGTTAATC AAGAGTCACC 840 

AGTACTTGGT CACTGCCGCA AATACCATGG CGCAACATGC TGCGGTAGAA GCCTTGACGG 900 

CTGGTAAAAA CGATGCGGAC CCATGAAGAA GGAATATATC CAACGTCGGG ACTATATCAT 960 

CGAAAAAATG ACTGCTCTTG GTTTTGAGAT TATCAAACCA GACGGTGCCT TCTATATTTT 1020 

TGCTAAAATT CCAGCGGGCT ACAATCAAGA CTCCTTTGCT TTTCTGAAGG ATTTTGCTCA 1080 

GAAGAAGGCC GTTGCCTTTA TCCCTGGTGC AGCCTTTGGA CGTTACGGGG AAGGCTACGT 1140 

CCGCCTATCT TATGCAGCCA GCATGGAGAC TATCAAAGAA GCCATGAAAC GACTTGAGGA 12 00 

GTACATGAGA GAAGCATGAT TCAGTCTATC ACGAGTCAAG GCTTGGTGCT TTACAATCGC 12 60 

AATTTTCGTG AGGATGACAA GCTCGTCAAA ATTTTTACAG AGCAGGTTGG CAAACGCATG 1320 

TTTTTTGTCA AACACGCTGG TCAGTCTAAG CTGGCCCCTG TTATTCAGCC CTTGGTGCTG 13 80 

GCACGATTTC TCTTGCGAAT CAATGATGAC GGACTCAGTT ACATCGAAGA CTATCATGAG 1440 

GTCATGACTT TTCCCAAGAT TAATAGTGAC CTCTTTGTCA TGGCCTATGC GACCTATGTG 1500 

GCAGCTCTTG CAGATGCTAG TTTGCAGGAC AATCAGCAGG ATGCTCCCTT GTTTGCTTTT 15 50 

TTGCAAAAGA CTTTGGAGTT GATGGAAGCA GGCTTGGATT ATCAGGTTTT GACCAATATT 162 0 

TTTGAAATTC AAATTTTGAC TCGATTTGGA ATCAGCCTCA ATTTTAATGA GTGTGTCTTC 16 80 

TGCCATCGGG TTGGTCAGGC TTTTGACTTT TCTTTCAAAT ATGGAGCCTG CCTCTGTCCA 1740 

GAG C ATT AT C ATGAGGATAA GAGACGTTGT CATCTCAATC CCAATATCCC CTATCTGCTC 1800 
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AATCAATTTC AAGCTATTGA TTTTGAGACT TTGGAGACCA TTTCGCTCAA GCCTGGAATC 1860 

AAGCAAGAGC TACGCCAATT TATGGATCAA TTATATGAAG AGTACGTTGG GATTCACCTA 1920 

AAATCAAAGA AATTTATTGA TTCCCTAGCA GACTGGGGAC AATTACTAAA AGAGGAAAAG 1980 

AAATGAAAAA AATCGCAGTA GATGCCATGG GGGGCGATTA CGCACCTCAG GCCATTGTTG 2 040 

AGGGTGTCAA TCAAGCCCTA TCTGACTTTT CAGATATCGA GGTTCAACTT TACGGAGATG 2100 

AAGCTAAAAT CAAGCAATAT CTGACAGCGA CAGAGCGCGT CAGCATTATC CATACGGATG 2160 

AGAAGATTGA TTCGGATGAT GAACCTACGA GAGCTATTCG GAATAAGAAA AATGCCAGTA 2 22 0 

TGGTATTGGC AGCCAAGGCT GTCAAAGATG GTGAAGCAGA CGCTGTCCTT TCGGCTGGGA 22 80 

ATACAGGTGC CTTGTTGGCA GCAGGATTCT TCATCGTGGG TCGTATCAAG AATATCGACC 234 0 

GTCCTGGACT CATGTCTACC TTGCCTACCG TTGATGGAAA AGGTTTTGAC ATGCTAGACC 2 400 

TTGGTGCCAA TGCAGAAAAT ACAGCCCAGC ACCTCCATCA ATATGCGGTT CTAGGTTCCT 24 60 

TCTATGCTAA AAATGTCCGT GGCATTGCGC AACCACGCGT TGGTTTGCTC AACAACGGAA 252 0 

CAGAGAGTAG CAAGGGCGAC CCGCTTCGTA AGGAAACTTA TGAATTACTG GCGGCTGATG 25 80 

AAAGTTTGAA CTTTATCGGA AACGTGGAAG CGCGTGATTT GATGAATGGC GTTGCAGATG 2 64 0 

TTGTTGTGGC AGATGGTTTC ACGGGAAACG CTGTGCTCAA ATCCATCGAA GGGACAGCTA 2700 

TGGGAATCAT GGGCTTGCTC AAGACAGCTA TTACAGGTGG TGGTCTTCGA GCGAAACTAG 27 60 

GTGCCCTCCT TCTCAAGGAC AGCCTCAGTG GTTTGAAAAA ACAGCTCAAT TATTCAGATG 2 82 0 

TTGGTGGAGC GGTCTTGTTT GGTGTTAAGG CACCTGTTGT CAAGACTCAT GGCTCAAGCG 2880 

ATGCCAAGGC TGTTTATAGT ACGATTCGTC AGATCCGTAC CATGCTAGAA ACAGACGTGG 2940 

TTGCCCAGAC TGCGCGTGAA TTTTCAGGAG AATAAAAGAG ATGACAGAAA AAGAAATTTT 3000 

TGACCGTATT GTGACCATTA TCCAAGAGCG ACAGGGAGAG GACTTTGTCG TGACAGAATC 3060 

CTTGAGTCTG AAAGACGATT TGGATGCGGA TTCTGTTGAC TTGATGGAGT TTATCTTGAC 312 0 

TCTGGAAGAT GAATTTAGTA TCGAAATCAG CGATGAAGAA ATTGACCAAC TCCAAAAC&? 3180 

AGGAGATGTG GTTAAAATCA TTCAAGGAAA ATAGCAATCG GAGTTCCAAG TCAACGGAAG 324 0 

TAGATGGTTT TTAGAAATGA GAAATATCGG ACAAGCTGGT AAAATCTTGG CTGACAGTGG 3300 

TTATCAAGGG CTCATGAAGA TATATCCTCA AGCACAAACT CCACGTAAAT CCAGCAAACT 33 60 

CAAGCCGCTA ACAGTTGAAG ATAAAGCCTG TAATCATGCG CTATCTAAGG AGATAAGCAA 3420 

GGTTGAGAAT ATCTTTGCCA AAGTAAAAAC GTTTAAAATG TTTTCAACAA CCTATCGAAA 3480 

TCATCGTAAA CGCTTCGGAT TACGAATGAA TTTGATTGCT GGTATTATCA ATCATGAACT 354 0 

AGGATTCTAG TTTTGCAGGA AGTCTAATAG TAAAAAAGTG ATTAGAAAAC ATCTTTTTTA 3600 
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AAAATAGAGA TGATTTTGAA ACAAAAAAGC TAATTCAAGA CGTTTCGATG CCAATTCAAG 3 6 60 

ATTTGGATGA AAAAAATTAA TAGATACTGT TATACTAAAC TTGTCAAGTT TGTAACAAGA 3720 

CAAATATTAA AAATAAAAAA GAGGTATTCG TTATGAATAC AAAAACGATG TCACAATTTG 37 80 

AAATTATGGA TACTGAGATG CTTGCTTGCG TTGAAGGTGG CGGATGCAAT TGGGGAGATT 384 0 

TTGCCAAAGC AGGTGTTGGA GGAGGAGCAG CACGAGGTCT TCAGCTAGGA ATTAAAACAA 3900 

GAACATGGCA AGGTGCAGCA ACTGGTGCTG TGGGAGGAGC TATACTTGGA GGTGTGGCCT 3960 

ATGCAGCGAC ATGTTGGTGG TAATTATGGA TTTTAAAAGT TTTATTATTG GTTTAGTAGT 402 0 

TGGTATATTT GGTCCTTATA TGGATC-ATTT AATTAGAAAA AAATTTTTAA AGTCTTCGGA 40 80 

GAAGAAAACA GAAAAATCTG TTAAAAAATA ATCAAAACTA TAAATGATGA ATCTGAATCA 414 0 

AAATTATTTT GCGCATGTAA AGAGGAGTCT TATAGTAACG AGTCAAAAAA GGAGTAACTA 4200 

TGAATCGTAA TTTAGAACGG TGTTATCTAT TCTGACTAGG AATAGATCAT ACCAGAGGTA 42 60 

GCTTAGAAAT AGCAGAGACA TTAGAAATTG AAGTAATAAA TAGGATGTCG TAAGTGTTAC 432 0 

TATCAATGAT TTATTTGTTT CAAGCTTGCC TAGGGTGACA GTAAAAAATC AATTTCCTTT 4380 

CAATAGCATA TTTTTAGTGG GCAGGACTCT TGTTCTGCCT ATTTTTTTAT CCAAAAAGTG 444 0 

CAGTTGGGAG GGAGATAGGC TCATTTGGGA AGGAAGTCCA GTTTTTGTTT AGTGATTGGG 4500 

GTAAGATAGT TGTTATCAGA TGAGTTAATA CTCTTCGAAA ATCAAATTCA AACCACGTCA 4560 

ACGTCGCCTT GCCGTATATA TGTGACTGAC TTCGTCAGTC CTATCTACAA CCTCAAAACA 4620 

GTGTTTTGAG CAGCCTACGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA 4680 

GGGAAAAGGA GATGAATATG AAATTTGGGA AACGTCATTA TCGTCCGCAG GTGGATCAGA 4740 

TGGACTGCGG TGTAGCTTCA TTAGCCATGG TTTTTGGCTA CTATGGTAGT TATTATTTTT 4 800 

TGGCTCACTT GCGAGAATTG GCTAAGACGA CCATGGATGG GACGACGGCT TTGGGCTTGG 4860 

TCAAGGTGGC AGAGGAGATT GGTTTTGAGA CGCGAGCCAT TAAGGCAGAT ATGACGCTTT 4 920 

TTGACTTGCC GGATTTAACT TTTCCTTTTG TTGCCCATGT GCTTAAGGAA GGGAAATTGC 4 980 

TCCACTACTA TGTGGTGACT GGGCAGGATA AGGATAGCAT TCATATTGCC GATCCAGATC 5040 

CCGGGGTGAA GTTGACTAAA CTGCCACGTG AGCGTTTTGA GGAAGAATGG ACAGGAGTGA 5100 

CTCTTTTTAT GGCACCTAGT CCAGACTATA AGCCTCATAA GGAACAAAAA AATGGTCTGC 5160 

TCTCTTTTAT CCCTATATTA GTGAAGCAGC GTGGCTTGAT TGCCAATATC GTTTTGGCAA 5220 

CACTCTTGGT AACCGTGATT AACATTGTGG GTTCTTATTA TCTGCAGTCT ATCATTGATA 5280 

CCTATGTGCC AGATCAGATG CGTTCGACAC TAGGGATTAT TTCTATTGGG CTAGTCATCG 5340 
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TCTACATCTT CCAGCAAATC TTGTCTTACG CTCAGGAGTA TCTCTTGCTT GTTTTGGGGC 5400 

AACGCTTGTC GATTGACGTG ATTTTGTCCT ATATCAAGCA TGTTTTTCAC CTCCCTATGT 54 60 

CCTTCTTTGC GACACGCAGG ACAGGGGAGA TCGTGTCTCG TTTTACAGAT GCTAACAGTA 552 0 

TCATCGATGC GCTGGCTTCG ACCATCCTTT CGATTTTCCT AGATGTGTCA ACGGTTGTCA 5580 

TTATTTCCCT TGTTCTATTT TCACAAAATA CCAATCTCTT TTTCATGACT TTATTGGCGC 564 0 

TTCCTATCTA CACAGTGATT ATCTTTGCCT TTATGAAGCC GTTTGAAAAG ATGAATCGGG 5700 

ATACCATGGA AGCCAATGCG GTTCTGTCTT CTTCTATCAT TGAGGACATC AACGGTATTG 57 60 

AGACTATCAA GTCCTTGACC AGTGAAAGTC AGCGTTACCA AAAAATTGAC AAGGAATTTG 582 0 

TGGATTATCT GAAGAAATCC TTTACCTATA GTCGAGCAGA GAGTCAGCAA AAGGCTCTGA 5880 

AAAAGGTTGC CCATCTCTTG CTTAATGTCG GCATTCTCTG GATGGGGGCT GTTCTGGTCA 594 0 

TGGATGGCAA GATGAGTTTG GGGCAGTTGA TTACCTATAA TACCTTGCTG GTTTACTTTA 6000 

CTAATCCTTT GGAAAATATC ATCAATCTGC AAACCAAGCT TCAGACAGCG CAGGTTGCCA 6060 

ATAACCGTCT AAATGAAGTG TATCTAGTAG CTTCTGAGTT TGAGGAGAAG AAAACAGTTG 6120 

AGGATTTGAG CTTGATGAAG GGAGATATGA CCTTCAAGCA GGTTCATTAC AAGTATGGCT 6180 

ATGGTCGAGA TGTCTTATCG GATATCAATT TAACCGTTCC CCAAGGGTCT AAGGTGGCTT 6240 

TTGTGGGGAT TTCAGGGTCA GGTAAGACGA CTTTGGCCAA GATGATGGTT AATTTTTACG 6300 

ACCCAAGTCA AGGGGAGATT AGTCTGGGTA GTGTCAATCT CAATCAGATT GATAAAAAAG 6360 

CCCTGCGCCA GTACATCAAC TATCTGTCTC AACAGCCCTA TGTCTTTAAC GGAACGATTT 6420 

TGGAGAATCT TCTTTTGGGA GCCAAGGAGG GGACGACACA GGAAGATATC TTACGGGCGG 6480 

TCGAATTGGC AGAGATTCGA GAGGATATCG AGCGCATGCC ACTGAATTAC CAGACAGAAT 6540 

TGACTTCGGA TGGGGCAGGG ATTTCAGGTG GTCAACGTCA GAGAATCGCT TTGGCGCGTG 6600 

CTCTCTTGAC AGATGCGCCG GTCTTGATTT TGGATGAGGC GACTAGCAGT TTGGATATTT 6 660 

TGACAGAGAA GCGGATTGTC GATAATCTCA TTGCTTTGGA CAAGACCTTG ATTTTCATTG 6720 

CTCACCGCTT GACTATTGCT GAGCGGACAG AGAAGGTAGT TGTCTTGGAT CAGGGCAAGA 6780 

TTGTCGAAGA AGGAAAGCAT GCTGATTTGC TTGCACAGGG TGGCTTTTAC GCCCATTTGG 6840 

TCAATAGCTA GAAAGAGGAG AGGATGAAAC CAGAATTTTT AGAAAGTGCG GAGTTTTATA 6 900 

ATCGTCGTTA CCATAATTTT TCCAGTAGTG TGATTGTACC CATGGCCCTT CTGCTTGTGT 6960 

TTTTACTTGG CTTTGCAACT GTTGCAGAGA AGGAGATGAG TTTGTCCACT AGAGCTACTG 7 020 

TCGAACCTAG TCGTATCCTT GCAAATATCC AGTCAACTAG CAACAATCGT ATTCTTGTCA 7 080 

ATCATTTGGA AGAAAATAAG CTGGTTAAGA AGGGGGATCT TTTGGTTCAA TACCAAGAAG 7140 
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GGGCAGAGGG TGTCCAAGCG GAGTCCTATG CCAGTCAGTT GGACATGCTA AAGGATCAAA 7200 

AAAAGCAATT GGAGTATCTG CAAAAGAGCC TGCAAGAAGG GGAGAACCAC TTTCCAGAGG 72 60 

AGGATAAGTT TGGCTACCAA GCCACCTTTC GCGACTACAT CAGTCAAGCA GGCAGTCTTA 732 0 

GGGCTAGTAC ATCGCAACAA AATGAGACCA TCGCGTCCCA GAATGCAGCA GCTAGCCAAA 7380 

CCCAAGCCGA AATCGGCAAC CTCATCAGTC AAACAGAGGC TAAAATTCGC GATTACCAGA 7440 

CAGCTAAGTC AGCTATTGAA ACAGGTGCTT CCTTGGCCGG TCAGAATCTA GCCTACTCTC 7 500 

TTTACCAGTC CTACAAGTCT CAGGGCGAGG AAAATCCCCA AACTAAGGTT CAGGCAGTTG 7 560 

CACAGGTTGA AGCACAGATT TCTCAGTTAG AATCTAGTCT TGCTACTTAC CGTGTCCAGT 7 620 

ATGCAGGTTC AGGTACCCAG CAAGCCTATG CGTCAGGGTT AAGCAGTCAA TTGGAATCCC 7 680 

TTAAATCCCA ACACTTGGCA AAGGTTGGTC AGGAATTGAC CCTTCTAGCC CAGAAAATTT 7 740 

TGGAGGCAGA GTCAGGTAAG AAGGTACAGG GAAATCTTTT AGACAAGGGG AAAGTTACGG 7 800 

CGAGTGAGGA TGGGGTGCTT CATCTTAATC CTGAGACCAG TGATTCTAGC ATGCTTGCAG 7 860 

AAGGTGCCCT ACTAGCCCAA CTTTATCCAT CTTTGGAAAG AGAAGGGAAA GCCAAACTCA 7 920 

CAGCTTATCT AAGTTCAAAA TATGTAGCAA GAATCAAGGT CGGTGATTCT GTTCGCTATA 7 980 

CTACGACTCA TGATGCCGGG AATCAACTTT TCCTAGATTC TACTATTACA AGTATTGATG 8040 

CGACAGCTAC TAAGACTGAG AAAGGGAATT TCTTTAAAAT CGAGGCGGAG ACTAATCTAA 8100 

CTTCGGAGCA GGCTGAAAAA CTTAGGTACG GGGTGGAAGG CCGCTTGCAG ATGATTACGG 8160 

GCAAGAAAAG TTACCTACGT TATTATTTGG ATCAATTTTT GAACAAAGAG TAATGTTCGT 8220 

GTTTTTAGAG TTAAATAATT TTTAAACTGT GAGAAAGATT CTTCTTGCAG TTTTTTCTTT 82 80 

ACAATTTTTG AAAAACATCT ACTATTTATT CGGTTAAATT CTTGTGTTTT TTGGTTTTTT 8340 

GTGGTAAAAT GTGCTCAAGT AATACGAAAG GCGAACTTTA AAATGTCAAA ACAATTGATC 84 00 

TATTCGGGAA AAGCTAAAGA TATCTATACA ACTGAGGATG AAAATCTTAT TATTTCAACT 84 60 

TACAAGGACC AGGCGACTGC TTTCAACGGT GTCAAGAAGG AGCAGATTGC AGGTAAGGGA 8520 

GTCTTGAATA ATCAGATCTC ATCTTTTATT TTTGAGAAAT TAAATGTGGC TGGTGTGGCG 8 580 

ACTCACTTTG TGGAGAAACT TTCAGACACG GAACAACTCA ATAAAAAGGT TAAGATTATT 8 640 

CCTTTGGAAG TCGTGCTCCG C AAC TAT AC T GCTGGTTCCT TTTCAAAACG TTTTGGTGTG 8700 

GATGAGGGAA TCGCCTTGGA GACTCCGATT GTCGAATTTT ACTACAAAAA TGATGATTTG 87 60 

GATGATCCAT TTATCAATGA TGAGCATGTG AAATTCCTAC AGATTGCGGG TGACCAGCAG 8 820 

ATTGCCTACT TGAAGGAAGA AACGCGTCGT ATCAATGAAC TATTGAAAGT CTGGTTTGCT 8 880 
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GAGATTGGGC TTAAATTGAT TGACTTTAAG CTAGAGTTCG GTTTTGACAA GGATGGCAAG 8940 

ATTATCTTGG CAGACGAATT TTCACCAGAT AACTGCCGCT TGTGGGACGC TGATGGCAAC 9000 

CACATGGATA AGGATGTTTT CCGTAGAGGA TTGGGAGAAC TAACCGACGT TTATGAGATT 9060 

GTTTGGGAAA AGTTGCAGGA ATTGAAATAA TCTGTTTGCA ACGGAAAACC TTCGTCTCTC 912 0 

AACTAAAAGG ACTCAGGCTG AAAAGGTCCC CCAGACCTTT TCACTCTGTA GAGAACTAGG 9180 

TGAACTAACA GATGTTTACG AAATTGTCTG GGAAAAGTTG CAGGGTTTAA AATAACAACC 924 0 

TCAAGGCTGT TTGGGAATAT TGCAAGAGCT GAAATAAAGG AATAAGAATT GATGGATAAA 9300 

CGTATTTTTG TTGAAAAAAA GGCTGATTTT CAGGTCAAGT CAGAGAGTTT GGTTAGAGAG 93 6 0 

CTCCAGCACA ACTTGGGACT GTCAAGCTTG AAAAGTATTC GTATTGTGCA AGTATATGAT 942 0 

GTATTTGACT TGGCTGAGGA CTTGTTTGCA CCTGCAGAGA AGCACATTTT CTCTGAGCAG 9480 

GTAACCGACC ATGTTTTAGA TGAAGTATCT GTGCAGGCGG ATCTTGCTAA CTATGCTTTC 954 0 

TTTGCCATTG AAAGTCTGCC AGGGCAGTTT GACCAGCGTG CAGCTTCGTC ACAGGAAGCC 9 600 

TTGCTTTTGT TGGGAAGTTC GAGTGACGTG ACAGTCAACA CAGCCCAACT TTACTTGGTG 9660 

AATAAAGATA TTGATGCGAC TGAGTTGGAA GCTGTCAAAA ACTACCTGCT CAATCCAGTT 9 72 0 

GATTCTCGTT TCAAGGATAT CACGACAGGG ATTGCCAAGC AGGAGTTTTC AGAGTCAGAC 978 0 

AAGACCATTC CCAAATTGAC TTTCTTTGAA AGCTATGCAG CAGAAGACTT TGCTCGCTAC 9 840 

AAGGCCGAAC AAGGGATGGC CATGGAAGTG GATGATTTGC TCTTTATCCA AGACTACTTT 9900 

AAGTCAATCG GGCGCGTGCC AACTGAGACT GAACTCAAGG TTTTGGACAC TTACTGGTCT 9960 

GACCACTGCC GTCATACGAC TTTTGAGACA GAGTTGAAAC ACATCGACTT TTCAGCTTCT 10020 

AAATTTCAAA AGCAATTGCA GTCAACCTAT GACAAGTATA TTGCCATGCG CGAGGAATTA 10080 

GGTCGGTCTG AAAAACCACA AACCTTGATG GATATGGCGA CTATTTTCGG TCGTTATGAG 10140 

CGTGCTAATG GACGATTGGA TGATATGGAA GTCTCTGACG AAATCAATGC CTGCTCAGTT 10200 

GAAATTGAAG TGGACGTTGA TGGTGTCAAG GAACCTTGGC TCCTCATGTT TAAAAACGAA 10260 

ACCCACAACC ATCCAACAGA AATTGAGCCA TTTGGTGGAG CGGCTACCTG TATTGGTGGA 10320 

GCTATTCGTG ATCCGTTGTC AGGCCGTTCC TATGTTTACC AAGCCATGCG TATTTCAGGT 10380 

GCTGGTGATA TTACAGCACC GATTTCGGAA ACTCGCGCTG GGAAATTGCC ACAACAAGTC 10440 

ATTTCTAAAA CAGCAGCTCA TGGTTATTCT TCATATGGTA ACCAGATTGG GCTTGCAACA 10500 

ACCTACGTTC GTGAATACTT CCACCCAGGC TTTGTAGCTA AACGTATGGA ACTTGGTGCC 10560 

GTTGTTGGTG CGACTCCCAA GGGCAATGTT GTCCGTGAAA AACCTGAAGC AGGTGATGTG 10 620 

ATCATCCTTC TCGGAGGCAA AACAGGTCGT GATGGTGTCG GTGGTGCGAC GGGCTCTTCT 10680 
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AAGGTTCAAA CAGTTGAGTC TGTAGAGACT GCTGGTGCTG AGGTTCAAAA AGGAAATGCC 10740 

ATCGAAGAAC GCAAGATTCA GCGCCTCTTC CGTAATGGCA ATGTCACTCG TCTGATCAAG 10800 

AAGTCCAATG ACTTTGGGGC AGGCGGTGTC TGTGTGGCTA TCGGTGAATT GGCAGACGGT 108 60 

CTTGAAATCG ACCTCAACAA GGTGCCTCTT AAATACCAGG GCTTGAATGG TACAGAAATT 1092 0 

GCCATCTCTG AATCACAAGA ACGGATGGCG GTCGTGGTTC GTCCTGAAGA TGTGGATGCC 10980 

TTCGTTGCCG AATGTAACAA AGAAAATATT GATGCTGTTG TGGTGGCGAC AGTAACTGAA 11040 

AAACCAAATC TTGTCATGCA CTGGAATGGT GAGACAATCG TTGACTTGGA GCGTCGTTTC 11100 

CTTGACACCA ATGGTGTGCG CGTGGTTGTC GATGCCAAAG TTGTGGACAA GGATGTCAAA 11160 

CTCCCAGAAG AGCGTCAAAC ATCTGCTGAA ACACTGGAAT CAGATACCCT TACGGTTCTA 1122 0 

TCTGACCTCA ACCATGCAAG TCAAAAAGGA TTACAGACTA TCTTTGACTG CTCTGTTGGA 112 8 0 

CGCTCAACGG TTAATCACCC ACTTGGTGGT CGTTACCAAC TCACACCAAC TGAGGCATCT 11340 

GTGCAGAAAT TGCCAGTTCA ACACGGTGTG ACTCATACTG CGTCGGTCAT TGCTCAAGGT 11400 

TTCAACCCAT ATGTAGCTGA ATGGTCTCCA TACCACGGTG CTGCTTATGC GGTTATCGAA 114 60 

GCAACTGCTC GTTTGGTGGC TGCTGGTGCC AACTGGTTCA AGGCTCGTTT CTCTTACCAA 1152 0 

GAGTATTTCG AGCGTATGGA TAAACAAGCA GAGCGTTTCG GTCAGCCAGT AGCTGCTCTT 11580 

CTAGGTTCTA TTGAAGCACA AATTCAGCTT GGCTTGCCAT CTATCGGTGG TAAGGACTCC 11640 

ATGTCTGGTA CCTTTGAAGA ATTGACCGTT CCGCCAACCT TGGCTGCCTT TGGGGTGACG 11700 

ACGGCAGATA GCCGTAAGGT GCTCTCTCCA GAATTTAAAG CTGTTGGGGA AAATATCTAC 117 60 

TACATCCCAG GTCAAGCCCT CTCTGCAGAG ATTGATTTTG ACTTGATTAA GAAAAATTTT 11820 

GCTCAGTTTG AAGCCATCCA AGCTGACCAT AAAGTGACAT CTGCATCAGC TGTCAAATAC 11880 

GGTGGTGTAG TTGAAAGTTT GGCTCTTGCT ACCTTTGGAA ACTATATTGG TGCAGAGGTG 11940 

ACCTTGCCTG AACTTGAAAC AGCTTTGACA GCTCAATTAG GCGGCTTTGT CTTCACATCT 12000 

CCTGAAGAAA TTGCTGGAGT AGAGAAGGTT GGACAAACGA AAGCAGACTT TACACTGACT 12 060 

GTCAACGGTG TGAAGCTAGA TGGACACAAG CTTGACAGTG CATTTCAAGG GACATTGGAA 1212 0 

GAAGTTTACC CAACAGAATT TACCCAAGCG AAAGAACTAG AAGAAGTACC AGCTGTGGCA 12180 

TCAGATGTTG TGATTAAAGC CAAAGAAAAG GTTGAAAAAC CTGTGGTTTA CATCCCAGTC 12240 

TTTCCAGGAA CCAACTCAGA ATATGATTCA GCTAAGGCCT TCGAAAAAGA AGGTGCAGAG 12300 

GTCAATTTGG TGCCATTCGT GACCTTGAAT GAAGAAGCTA TTGTCAAGTC AGTTGAAACT 123 60 

ATGGTTGACA ATATCGACAA GACTAATATT CTCTTCTTTG CTGGTGGATT CTCGGCTGCG 1242 0 
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GATGAACCAG ATGGTTCAGC TAAGTTTATC GTCAATATCC TGCTTAATGA AAAAGTGCGT 124 80 

GTGGCTATTG ATAGCTTTAT CGCCCGTGGT GGTTTGATTA TCGGTATTTG TAATGGATTC 1254 0 

CAAGCCTTAG TCAAATCGGG TCTCCTACCC TACGGAAACT TTGAAGCTGC TAACAGTACT 12 600 

AGCCCAACCC TCTTCTACAA TGATGCCAAC CAACACGTGG CCAAGATGGT GGAAACTCGC 12 6 60 

ATTGCCAATA CCAACTCACC ATGGTTGGTT GGTGTGCAAG TGGGCGATAT CCACGCTATT 1272 0 

CCTGTTTCGC ACGGTGAAGG GAAGTTTGTC GTGACGGCTG AGGAATTTGC AGAGCTCCGT 127 8 0 

GACAATGGAC AAATTTTCAG CCAATACGTT GACTTTAACG GTAAACCAAG TATGGATTCT 12840 

AAGTACAATC CGAATGGTTC TGTCCATGCC ATCGAAGGAA TTACCAGCAA GAATGGTCAA 12900 

ATCATCGGTA AGATGGGCCA CTCAGAACGT TATGAGGATG GTCTTTTCCA AAATATCCCA 129 60 

GGCAATAAAG ACCAACACCT GTTCGCATCA GCGGTTAAAC ATTTCACTGG AAAATAAGAC 1302 0 

TTACAGATTT TCTAATAGAT AGTATCAGTA ATGTAAAAGT CATGTAAATC TAGCTCTTGA 13080 

TGATTACAAA TGAAAATTAG GTATAAAAAA TGACATACGA AGTAAAATCT CTTAATGAAG 13140 

AATGTGGTGT TTTCGGTATT TGGGGACATC CAGATGCTGC TAAGTTGACC TATTTTGGAC 132 00 

TCCACAGTCT TCAACACCGT GGTCAGGAGG GGGCAGGAAT CCTCTCCAAT GATCAAGGAC 132 6 0 

AACTGAAGCG CCATCGTGAC ATGGGGCTTT TATCAGAAGT TTTCAGAAAT CCAGCTAATT 1332 0 

TGGATAAATT GACAGGAGCT GGTGCGATTG GGCATGTGCG TTATGCGACT GCTGGCGAAG 13380 

CTTCTGTAGA TAACATCCAG CCCTTCCTCT TCCGTTTTCA CGATATGCAG TTTGGTTTGG 1344 0 

CTCATAATGG AAATCTGACC AATGCAGCCT CTCTCAAGAA AGAACTGGAA CAAAGAGGAG 13500 

CAATTTTCAG CGCGACTTCG GACTCTGAAA TCTTGGCTCA CCTCATTCGT CGCAGTCATA 13560 

ATCCTAGCCT GATGGGCAAA ATCAAGGAAG CGCTCAGCCT TGTCAAAGGT GGTTTTGCCT 13620 

ATATCTTGCT GTTTGAGGAC AAGTTGATTG CGGCTCTTGA CCCAAATGGA TTCCGACCGC 13680 

TTTCGATTGG TAAAATGGCT AATGGAGCAG TTGTTGTATC TTCTGAAACC TGTGCTTTTG 13740 

AGGTCATTGG TGCCGAGTGG ATTCGTGATT TGAAGCCAGG TGAGATTGTG ATCATTGATG 13800 

ACGAGGGCAT TCAGTATGAC AGCTATACAG ATGATACCCA GTTGGCGGTT TGTTCTATGG 13860 

AGTATATCTA CTTTGCTCGC CCTGATTCTA ATATCCACGG TGTCAATGTC CATACGGCAC 13 92 0 

GTAAGAGAAT GGGAGCGCAA TTGGCGCGAG AATTTAAGCA TGAGGCAGAT ATTGTAGTTG 13980 

GTGTGCCCAA TTCTTCCCTA AGCGCGGCTA TGGGATTTGC GGAAGAATCA GGCTTACCAA 14040 

ATGAAATGGG TCTGATCAAA AACCAATACA CCCAGCGAAC TTTTATCCAA CCGACTCAAG 1410 0 

AATTGCGGGA GCAAGGAGTG CGGATGAAAC TGTCTGCTGT TTCGGGTGTT GTCAAAGGCA 1416 0 

AACGTGTGGT CATGGTGGAT GATTCCATTG TACGTGGAAC AACCTCTCGT CGTATCGTTC 1422 0 
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AGCTCTTGAA AGAAGCGGGT 
CGTATCCATG TTTCTACGGG 
ATACGGTCGA AGAAACTCGC 
ATGGCTTGAT TGAGTCGATT 
CTTACTTTGA CGGTGACTAC 
GTTTGGAAGA AAAGACCAGT 
AAAAATAAAT GACAAATAAA 
AGCATTTAGT CAGTAGACGC 
TTTAGATAAA AAGATGGTTT 
GAAAAATAAA ATGGCAAATA 
TTATGAAGTT GTTGAACGGA 
GGGAGCTCTT GGTGGCTTTG 
CGTCTTGATT TCAGGGACTG 
CAAGCACGAT ACCATCGGGC 
AGGTGCGGAA CCCCTCTATT 
GCTAGAACAA GTGGTTGCTG 
CGGTGGGGAA ACGGCTGAAA 
TTTTGCGGTC GGTGTGGCTG 
AGATGTTCTT CTCGGACTTG 
TCGTGTCTTT GCGGATTACA 
TAAGGAAGTT CTACTTGAGC 
AGAAGAGTTG GTCAACGGCA 
TCGTATGTTT GCAGATGACC 
AATTTTCAAA ACCCTTGAAA 
CAATATGGGT GTGGGACTTA 
ATTGTTGGAT GAAGCAGTCT 
CATTATCAAA TGAAAAAAAT 
ATTGCCGAAG AATTTCCAGT 
GAGCGTGCAA AGCAGCTCGG 



669 

GCGACTGAGG TTCACGTTGC CATTGGAAGT CCTGCACTAG 14280 

ATTGATATCC AGACCCGTCA GGAGCTGATT GCAGCCAATC 14340 

CAAATCATTG GTGCGGACAG TCTGACTTAT CTTTCAATTG 14400 

GGTATCGAAA CAGATGCGCC GAACGGTGGT CTCTGTGTCG 14460 

CCAACGCCTC TTTATGACTA CGAAGAAGAC TATCGTAGAA 14520 

TTTTACAAGT AGGCGACAGA TTCTCCATTA AAGAAAAGGA 14 580 

AATGCATATG CCTCACGTCT CACTACTGAC TAAAGGCTTA 14 640 

TTTGTCCTAT AGGATCAAAG CTAGAGCCCT GACTAGTATT 14 7 00 

ATCTAAAAAT ACGTCGCAGT CTTTCTCAAA AAAAGAAAAG 147 60 

AAAATGCGTA CGCTCAATCT GGTGTGGATG TTGAAGCGGG 14 820 

TTAAAAAGCA CGTGGCCCGT ACGGAGCGTG CAGGTGTCAT 14 8 80 

GTGGTATGTT TGACCTTTCC AAGACTGGGG TTAAAGAACC 14 94 0 

ACGGTGTCGG AACCAAGCTC ATGTTGGCTA TCAAGTACGA 15000 

AGGACTGTGT GGCCATGTGT GTCAACGACA TCATTGCTGC 15060 

TTCTCGACTA CGTAGCGACA GGGAAGAATG AACCAGCTAA 15120 

GTGTGGCAGA AGGTTGTGTG CAGGCTGGTG CTGCCCTCAT 15180 

TGCCGGGCAT GTACGGCGAA GACGACTATG ACTTGGCTGG 15240 

AAAAATCTCA AATCATTGAC GGTTCAAAGG TGGTAGAGGG 153 00 

CTTCAAGTGG GATTCACTCA AATGGTTACT CTTTGGTTCG 153 60 

CAGGTGAGGA AGTCCTACCA GAATTGGAAG GCAAGAAACT 1542 0 

CGACTCGTAT CTATGTCAAG GCTGTCTTGC CGCTCATCAA 154 80 

TTGCCCACAT CACAGGTGGT GGCTTTATCG AAAATGTCCC 15540 

TAGCTGCTGA AATTGATGAA AGTAAAGTTC CAGTGCTTCC 15 6 00 

AATACGGTCA GATTAAACAC GAAGAAATGT TTGAAATCTT 15660 

TGTTGGCGGT CAGCCCTGAA AATGTAGAGC GTGTAAAAGA 1572 0 

ATGAAATTGG TCGCATCGTC AAGAAAGAAA ACGAAAGTGT 15780 

AGCGGTTTTT GCCTCTGGTA ATGGCTCAAA TTTTCAGGTG 15840 

GGAGTTTGTC TTTTCAGACC ATCGTGATGC CTATGTGCTT 15 9 00 

CGTTCTGTCC TATGCTTTTG AACTCAAGGA GTTTGAGAGC 15960 
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AAGGCAGACT ACGAAGCAGC CCTTGTCGAA CTCTTGGAAG AACACCAGAT TGACTTGGTT 16020 

TGCCTAGCAG GCTACATGAA AATCGTTGGA CCAACCTTAT TGTCGGCTTA TGAAGGTCGG 16080 

ATTGTCAACA TTCATCCAGC CTACTTGCCA GAATTTCCAG GAGCTCATGG GATTGAGGAT 16140 

GCTTGGAATG CTGGCGTGGG TCAGTCTGGT GTGACCATTC ACTGGGTGGA TTCGGGTGTG 16200 

GATACAGGCC AGGTCATCAA ACAGGTTCGT GTGCCACGAC TAGCTGATGA TACCATTGAC 16260 

AGATTTGAAG CTCGCATCCA TGAAGCAGAG TACAGGCTGT ATCCGGAAGT AGTGAAGGCT 16320 

CTATTTACAG ATTGACTTTT TGATGATTCA TATGATATCT TTGATTTTAA ATTGGAGTCA 16380 

GTGTTTGTTG AAGACGGCTT CAAACGGAGG TATTTGTAAT GTTAGAATCT AAAAAAACAA 16440 

CTCGATATGT ATTTTATGTC TATCTGATGT TATTAACTTG GGGAATCTTA TTTAAGTTTG 16500 

AAACAAATCC TGAATTTATA GCATTTTTCT TAGCTCCAAG GTATATCAAT TGGATTCCAT 16560 

TTTCAGAACC ACTAATAGTC GATGGAAAAA TTGTTTTTGC TGAAATGTTA TTTAATCTGA 1662 0 

TTTTCTTTAT TCCATTAGGT GTTTGTTTCC CTTTGATAAA AACTAATTTA TCTAGTTTAA 1668 0 

GAATAGTCGG GACAGGTTTC TTGATTAGTT TATTGTTTGA GTGCTTACAG TATATTTTAG 16740 

CAATAGGTAT AACAGATATA ACGGATTTGA CTTTAAATAC GCTAGGTGTC TGTGTAGGCT 168 00 

TACTGATTTA TCAAATTTTT ATAAGAGTGT TCAAATCACA GACTAGAAAA TGGATCAATA 16860 

TCTTAGGTAT GCTTAGCCTT GGTTTTGCTT ATCTTGTTTT ACTGTTACTG CATTTACTTA 1692 0 

GTGTTTAACT AATGATTAAA AAGGAGAATA TAATGACTAA ACGCGTCTTA ATCAGCGTCT 16980 

CAGACAAAGC GGGCATTGTT GAATTTGCCC AAGAACTCAA AAAACTTGGT TGGGAGATTA 17040 

TCTCAACAGG TGGAACTAAG GTTGCCCTTG ATAATGCTGG GGTGGATACC ATTGCTATCG 17100 

ATGATGTGAC TGGTTTCCCA GAAATGATGG ACGGTCGTGT GAAGACCCTC CACCCAAATA 17160 

TCCACGGAGG GCTTCTCGCT CGTCGTGACT TGGATAGCCA CTTGGAAGCG GCTAAGGACA 17220 

ACAAGATTGA GCTCATTGAC CTTGTGGTGG TCAACCTTTA CCCATTTAAG GAAACTATCC 172 80 

TTAAACCAGA TGTGACTTAT GCTGATGCAG TTGAAAATAT CGATATTGGT GGGCCATCTA 1734 0 

TGCTTCGTTC AGCAGCGAAA AATCATGCCA GTGTTACAGT TGTGGTAGAT CCTGCTGACT l7400 

ACGCTGTGGT TTTGGATGAA TTGGCAGCAA ACGGCGAAAC CTCTTATGAA ACTCGCCAAC 17460 

GTTTAGCAGC CAAAGTATTT CGTCACACAG CGGCTTATGA CGCCTTGATT GCAGAATACT 17520 

TCACAGCTCA AGTGGGTGAA AGCAAGCCTG AAAAACTCAC TTTGACTTAT GACCTCAAGC 17580 

AACCAATGCG TTACGGTGAG AATCCTCAAC AAGACGCGGA CTTTTACCAG AAAGCTTTGC 17640 

CTACAGACTA CTCCATTGCT TCAGCCAAAC AGCTCAACGG GAAAGAATTG TCATTTAATA 1770 0 

ATATCCGTGA TGCAGATGCT GCTATCCGTA TCATCCGTGA CTTCAAAGAT AGTCCAACCG 17760 
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TTGTGGCTCT CAAACACATG AATCCATGTG GAATTGGTCA AGCTGATGAC ATCGAGACTG 1782 0 

CTTGGGACTA CGCTTATGAG TCTGACCCAG TGTCTATCTT TGGTGGGATT GTCGTCCTCA 178 80 

ACCGTGAGGT GGATGCTGCG ACAGCTGAGA AGATGCACGG CGTTTTCCTC GAAATCATCA 17940 

TTGCACCAAG CTATACGGAT GAAGCGCTAG CCATTTTGAT CAATAAAAAG AAAAACTTGC 18000 

GTATCCTTGC CTTGCCATTT AATGCTCAAG AGGCTAGCGA AGTGGAAGCA GAATACACAG 18060 

GTGTAGTCGG TGGACTTCTC GTGCAAAATC AAGACGTGGT CAAGGAAAGC CCAGCTGACT 1812 0 

GGCAAGTGGT GACTAAACGT CAGCCAACTG AGACAGAAGC GACTGCTCTT GAGTTCGCTT 18180 

GGAAGGCTAT CAAGTACGTC AAATCAAATG GTATTATCGT GACCAACGAC CACATGACAC 18240 

TTGGTGTTGG TCCAGGTCAA ACCAACCGTG TGGCTTCTGT TCGCCTTGCC ATTGACCAAG 183 00 

CCAAAGATCG TCTGGACGGG GCGGTCCTTG CTTCAGATGC CTTCTTCCCA TTTGCGGATA 18360 

ACGTGGAAGA AATCGCCAAA GCAGGAATTA AGGCCATCAT CCAGCCCGGT GGCTCTGTCC 1842 0 

GTGACCAAGA ATCCATCGAA GCAGCGGATA AATACGGCTT GACTATGGTC TTTACAGGTG 18480 

TGAGACATTT TAGACATTAA GAAGATAAAA GGGAAGAAAA CAGTTTCTTT CCTTTTTTGG 1854 0 

CTTAAAATAC TAACTGAAAC AAGATTAAAA CGAACTTTTT TGATATAATG TTGGTAAATA 18600 

ATTCGCAAAA GAGGTTGAGG AATGAAACTG CTTGTTGTCG GTTCTGGTGG TCGTGAGCAT 186 6 0 

GCGATTGCTA AAAAGTTACT TGAATCAAAA GACGTGGAAA AAGTCTTTGT AGCTCCTGGG 18720 

AATGATGGGA TGACTCTGGA TGGTTTGGAA TTGGTAAATA TCTCTATTTC CGAACATTAT 18780 

AAATTGATTG ACTTCGCAAA GACCAATGAT GTTGCTTGGA CCTTTATCGG TCCAGATGAT 18840 

GCCCTTGCTG CTGGTATCGT GGATGATTTT AACCAAGCTG GACTTAAGGC CTTTGGTCCG 18900 

ACTAGGGCTG CAGCGGAGCT GGAGTGGTCC AAGGATTTCG CCAAGGAAAT CATGGTCAAA 18960 

TACGGCGTTC CGACAGCAAC ATATGGCACA TTTTCAGATT TCGAGGAAGC CAAAGCCTAT 1902 0 

ATCGAAAAGC ATGGTGCGCC TATCGTAGTC AAGGCGGATG GCTTGGCACT TGGGAAGGGT 19080 

GTCGTCGTTG CTGAGACGGT TGAGCAAGCG GTCGAAGCCG CTCATGAGAT GCTTTTGGAC 19140 

AATAAATTTG GTGACTCAGG TGCGCGCGTG GTTATTGAGG AATTCCTTGA AGGAGAGGAA 19200 

TTTTCACTCT TTGCCTTTGT CAATGGTGAT AAGTTCTACA TCATGCCAAC GGCTCAGGAC 19260 

CACAAACGTG CCTATGATGG CGACAAAGGG CCTAACACGG GTGGTATGGG TGCCTATGCG 19320 

CCAGTCCCAC ACTTACCACA GAGTGTAGTT GATACAGCGG TTGACACCAT TGTCAAGCCA 19380 

GTTCTAGAAG GGGTGATTAA AGAAGGTCGC CCTTATCTGG GAGTTCTTTA CGCAGGGCTT 19440 

ATCCTGACAG CTGATGGACC GAAAGTCATT GAGTTCAACG CTCGGTTCGG AGATCCAGAA 19500 
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ACTCAGATTA TCTTGCCTCG CTTGACCTCT GACTTTGCTC AAAATATCAC AGATATCCTG 195 60 

GATAGCAAGG AGCCAAATAT CATGTGGACG GACAAGGGTG TGACTCTGGG TGTGGTTGTC 19620 

GCATCCAAGG GCTACCCGCT AGACTATGAA AGGGGCGTTG AGTTGCCAGC CAAGACAGAA 196 8 0 

GGCGATGTCA TCACCTACTA TGCAGGGGCT AAGTTTGCGG AAAATAGCAG AGCACTGCTC 19740 

TCAAACGGCG GACGAGTTTA TATGCTCGTT ACCACAGCAG ATACCGTCAA AGAAGCCCAA 19800 

GCCAGCATAT ACCAAGAACT ATACCAACAA AAAATAGAAG GACTCTTCTA CCGAACAGAT 198 60 

ATCGGAAGCA AGGCAATTAA GTAAAGATAT AAGAATAACG CGCCGTAGTC GCCAAACACG 19920 

ATAATGGTCG TCGTGGTGAA AAGACCAGAA CAGTGAATGT TCTGGTCAGG GGGAAACTTG 19980 

GAGACCTTAG GCTCAAAGTT TAGGAATGAA ACCGAAGGTT TGCTTCCGCC TCCATCACCT 2 0040 

AAGACCATTA TCAAAAAGAA AAATAAAAAT TCACAAAATA CGTTAATGAT CGTATGGTTT 2 0100 

GCGAGCGTTA GCGAGCTAAT ATAGAACAAT CACCGCCGTT GTGAAAGAAC GATTGGATGA 2 0160 

TAATCCAATC GTTCAGGGAA ATTGGAAGAC CTTGGGTTTC CAATTTAGGC ATGAGACACC 2022 0 

TTTGGTGGCT GCTGCCGTCC CTCACAAGCT AAGGTGATTG TTGAAAAAGA GGAAAAAGGA 20280 

GAAGAAATGA AACCAGTAAT TTCCATCATC ATGGGCTCAA AATCCGACTG GGCAACCATG 20340 

CAAAAAACAG CAGAAGTCCT AGACCGCTTC GGTGTAGCCT ACGAAAAGAA AGTTGTTTCC 2 0400 

GCACACCGTA CACCAGACCT CATGTTCAAA CATGCAGAAG AAGCCCGTAG TCGTGGCATC 2 0460 

AAGATCATCA TCGCAGGTGC TGGTGGCGCA GCGCATTTGC CAGGCATGGT AGCTGCCAAA 20520 

ACAACCCTTC CAGTCATTGG TGTGCCAGTC AAGTCTCGTG CTCTTAGTGG AGTGGATTCA 2 0580 

CTCTATTCTA TCGTTCAGAT GCCGGGTGGG GTGCCTGTTG CGACCATGGC TATCGGTGAA 20640 

GCTGGAGCGA CTAACGCAGC TCTCTTTGCC CTCCGTCTCC TCTCTGTAGA AGATAAGTCC 20700 

ATTGCGGATG CACTTGCCAA CTTTGCTGAA GAACAAGGAA AAATCGCAGA GGAGTCGTCA 2 0760 

AATGAGCTCA TCTAAAACAA TCGGAATTAT CGGTGGCGGT CAACTGGGTC AGATGATGGC 2 0820 

CATTTCTGCT ATCTACATGG GCCACAAGGT TATCGCGCTG GATCCTGCGG CGGATTGCCC 2 0880 

GGTCTCTCGT GTGGCGGAAA TCATTGTGGC ACCTTATAAC GATGTAGACG CCCTCCGTCA 2 0 940 

GTTGGCAGAC CGTTGCGATG TCCTCACTTA TGAGTTTGAA AATGTCGACG CTGACGGTTT 21000 

GGATGCCGTT ATCAAGGATG GACAACTCCC TCAAGGAACA GATCTGCTCC GCATTTCGCA 21060 

AAATCGTATT TTTGAAAAGG ACTTTTTGTC AAACAAGGCT CAAGTCACTG TGGCACCCTA 21120 

CAAGGTCGTG ACTTCTAGCC TAGACTTGGC AGATATCGAC TTGTCGAAAA ACTATGTCCT 21180 

CAAGACTGCG ACTGGTGGCT ACGATGGTCA TGGACAAAAG GTTATTCGTT C AGAAG C AG A 21240 

CTTGGAAGCA GCCTATGCGC TAGCAGACTC AGCAGACTGC GTCTTGGAAG AATTTGTCAA 21300 
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CTTTGACCTT GAGATTTCTG TCATCGTGTC 
AGTTCAGGAA AATATCCACC GCAACAATAT 
TTCTGAAAGT CTAGTAGACA AGGCTAAAGC 
CTTGTCTGGA ACTCTCTGTG TGGAAATGTT 
AATCGCCCCA CGACCACATA ACTCTGGGCA 
GTTTGACACC CATATTCTGG GTGTTCTCGG 
GCCAGCCGTT ATGCTTAATG TCCTCGGTCA 
AGAAAATCCA AGCGCCCACC TCCACATGTA 
GATGGGACAT GTGACTTTGT TTAGTGATGT 
GATTGATTTT TAGGACAAGT CTATGATACA 
TAAGACTGGA GCAGTTGTTG AAGTGTTGTA 
TAAATATGAA GAACTAGCTG CACAATACCC 
GCTGGATACG GATTTGAATA CACTAGATCA 
GTTTGAGTAG AAATCTTGGT TTACCTAGAT 
AAAAATTAAC ACATGATCAA CCGTTACTCT 
GAAAATAAAT ACCGTGCTTG GCTTGAGGTG 
TTGGGGGAAA TCCCTAAGGA AGATGTGGCT 
GACCGTATTT TGGAAATTGA GCAGGAGACG 
GTTTCTGAGA CTCTTGGTGA AGAGCGCAAG 
GTGGTGGATA CTGCTTATGG TTACCTCTAC 
CTTGAAAACT TCACTAATAT CATCGCTGAC 
ATGGGGCGTA CTCATGGTGT GCACGCTGAG 
TGGTACAGCG AAATGAAACG CAATATCGAG 
GCTGGTAAGA TTTCTGGTGC GGTTGGGAAC 
TATGTCTGCG ATAAACTTGG CATCCGTGCC 
GACCTTCACG CTGAGTACTT TGCGGTTCTT 
GCGACTGAGA TTCGTGGTCT ACAAAAATCT 
AAAGGGCAAA AAGGGTCTTC AGCAATGCCT 
ATGACTGGTC TGGCGCGTGT CATTCGTGGT 
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AGGAAATGGC AAGGAGGTGA CGTTTTTCCC 213 6 0 

CCTGTCTAAG ACCATCGTAC CAGCCCGCAT 2142 0 

TATGGCAGTG CGAATCGCAG AACAACTCAA 214 8 0 

TGCGACAGCT GATGACATCA TTGTCAATGA 2154 0 

CTATTCTATT GAAGCCTGTG ATTTCTCTCA 21600 

AGCACCATTA CCAGTCATCA AACTCCATGC 21660 

GCATGTCGAG GCTGCTGAAA AATATGTCAC 217 2 0 

TGGTAAAATA GAAGCAAAGC ATAATCGTAA 21780 

GCCGGATAGT GTGGAAGAGT TTGGGGAAGG 21840 

AATTATCGTT AATACATTTA TTGAAAAGTA 21900 

TGCCAGTGCT GACCAAGATA AGGTACAAGC 219 6 0 

CGAAAATTAT TTAGCTATCT ATAATGTACC 22 020 

TTACCCGTCT GTGTTTATTG GAAAAGAGGA 22 08 0 

AGCTTATTCC CAACAGCTTA AGAAGAAAGG 2214 0 

CGCCCTGAGA TGGCGAATAT TTGGAGTGAA 222 0 0 

GAAATCCTCT CTGACGAGGC ATGGGCTGAG 222 6 0 

TTGATTCGCA AGAAGGCGGA CTTTGACATC 2232 0 

CGCCACGATG TGGTGGCTTT CACGCGTGCG 223 8 0 

TGGGTTCACT ATGGGTTAAC TTCTACTGAC 22440 

AAGCAGGCCA ACGACATCAT CCGTCGTGAC 22 5 00 

AAGGCCAAGG AGCACAAGTT CACCATCATG 22 5 60 

CCGACAACCT TTGGTCTTAA ATTAGCAACT 22 62 0 

CGCTTCGAGC ATGCGGCTGC TGGTGTAGAA 2268 0 

TTTGCCAATA TCCCACCATT TGTAGAGGAG 2274 0 

CAAGAAATCT CTACACAAGT CCTTCCTCGT 22 8 00 

GCCAGCATTG CGACTTCAAT CGAACGTATG 22 860 

GAGCAACGCG AAGTAGAAGA GTTCTTTGCT 22 920 

CACAAACGCA ACCCAATCGG TTCTGAAAAT 22 9 80 

CACATGATTA CGGCTTATGA AAACGTCGCT 23 040 
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CTCTGGCATG AACGCGATAT TTCTCACTCA TCAGCTGAGC GTATCATCAC AC CAGATACG 23100 

ACCATTTTGA TTGACTACAT GCTCAACCGT TTTGGAAATA TCGTCAAGAA CTTGACAGTC 2 3160 

TTCCCAGAAA ATATGATCCG AAACATGAAC TCGACTTTTG GTCTTATCTT TAGCCAACGG 2 3 220 

GCTATGTTGA CATTGATTGA AAAAGGCATG ACCCGTGAGC AAGCCTATGA CTTGGTGCAA 2 3 280 

CAAAAACAGC CTACTCTTGG GACAACCAAG TAGACTTTAA ACCACTTCTT GAGGCAGATT 23 340 

CAGAAGTAAC ATCACGTCTC ACACAAGAAG AAATCGATGA AATCTTCAAC CCAGTTTATT 2 3400 

ACACCAAACG AGTGGATGAT ATCTTTGAAC GTCTTGGACT AGGTGATTAA TTAAAAAATA 2 3460 

AACAGCGAGC TTCAATCTCG CTGTTTATTT TTTATCGAAA AGACTTAGTC TTCTTTTCTT 2 3 520 

TTAGTGAGTC CATAGGCTGC TAGTGTGGAC ATGAGTCCTG CGACTACTAG TCCTGCAGAA 2 3 580 

TCGTGAGTTC CTGTTTCAGG AAGTTTTTTC TCTGTTACCA CAGGAGCTGG ATCTTGAGGA 2 3 640 

AGAACTTTGC TTTCCTCAGC AGGAGCAGTT GATGGAGCTG GTTGGCTTGG GATTTCTAGT 2 3 700 

TTTGGTTTTT CTTCAGCAAT AGCGGCTTGT CCGTTTTCAT CGCCTACATG TGTTACCATA 2 3760 

GTTCCGACTT CGACTATTTG AGTAACGGCT TCCTGTGCTA CGACACTATT TACAAGTGTT 23820 

TTCACTTCCT TACCATCGGC AGAAGTGCTC ACAGAGTAGA AGTTGCTACG ATGTCCATTG 2 3 880 

ACGCCCTTAG TAATGACTTG TGTTTTTCCT TTGACTAAGA GTGGATTTTC ACAAGTCACT 2 3 940 

GTGGTAAATG GAATTTCTTC TTCTTGGATA TCCAGTCTAG GTTTTACCTC AGTAGTTGGT 2 4 000 

GCAAGACCAC TTTCATCACC CTTGTGAGTT ACAGGAGCGC CAACTTCAAC CACTTGGTTT 24 060 

ATAACTTCTT TGGTTACCTG GCTATCAAGG ACTGTTTCTG TTGTTTTTCC ATTTTCAGTG 24120 

AGTACAGAGA TGTAATGAGT TCGTTCACCT TTGACTCCTG CTGTGATAAT ATTTTCCTGA 24180 

CCGGCTGGGA GGTTAGGATT TTCTTTCTTG ATAACTTCAA ATGGAATTTC TTCAGTTCTT 24240 

GTGATGAGTT CTGGTCTGGT TTCAACATTG GCAGCCACTT CATTTTCATC TAGGCTTCCT 24300 

GAATGAGTTA CAGCTGGTTT GAGGCCTTGA AGAGCGGCTT TTAGGTTGGC TACAAGCGTG 243 60 

TCAAGCTCAG CTTGTTTATT ACGGTTGAGG TTGTAATTTA GAGCTGTTTT AGCTGCGTCA 2 442 0 

AGGGCCTCAA GACTTTCTTT ACTATATCCT TCTAAGTTTG TAGGAATTTT AGCTAATTCT 2 44 80 

TCGCGGAGAG CATTATAATT AGCACGAAAG TAGTCTTTGT TGTGGTCTGC AAAGGCAGTC 24540 

ATGAGTTCAA AGATTTCCTC TTCCTTGTAT TCAGCGCTTG GTCTATCTGC CCAGATTGAA 2 4 600 

AGCATACTTC CGACTGTTGG AAGATCTACT TCAGGATATT TGGTAGAAGC TAGTTGATTG 24660 

AATGGTGTTT TTCCAGTATT CTCAATAGCT TTCTTGAGGA AACCACCACC ATCTTCTGGT 24720 

TTTTGACCAA GAATGTAGTA CCAGTCACCG TTGGTATTCA AGAATTTATA GCCTTTGCTT 24780 

GCTAGGTATT GAGGTGATGC GAGGTTATAT CCCCACCAGC CTTTAGACCA GTAAGAAATC 24840 
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AAGACATCTT TGTCAAACTG AACATCGTCC TTGTCTTCAT AGTAGAAGCC ATCGTTGAAG 24900 

GCCATTGGTT GAAGCCCTCT TTCTTTGGCC ATAGCTGCGA GGGTGTTGGC ATATTCGGCA 2 496 0 

AATTTGCCAT AGAGTTGATA CCACTTGAGG TAGTACCAGC CTTGGGCACT AGTCGCATCG 2 5020 

TTGGCGTATT CGTCAGTACC AAAGTTGAAA ATCTTTGTTT TACCTGCAAA GAAGTCCATG 2 5080 

TATTTACCGA TGAGGGCTTT TACAAAGTTC ATCGCTTCTT CGTTTTTCAA GTCCATAGTT 25140 

GTTTTTGAAA CTTTATCAAA GTGGGCTTGA GGATTTTTAA TACCTAATTT TTCCATGGCA 2 5200 

ACCAGCATAG CATCCATGTG ACCTGGACTG TTAATAGCTG GGATGAGACC GATGTCCTTA 2 5260 

GATTTAGCGT ATTCAATTAG CTCTGTTACT TCTGCCTGTG TTAGTGCAGT ACCGTTTGGA 2 5320 

TCGTCGTAGT AAGCTTTAGT TCCTTCGATA ATAGCTTTTT TAACGTCATC ACTAGCATAG 2 5380 

GTTTTTCCGT TGGCAGTAAT GGTCATATCA TCGAGTAGAA AGCGAAGTCC GTCATTTCCT 2 5440 

AGAAGGAGAT GGACATCAGA ATATCCGAGC TCACTGGCCT TGTCTACGAT GCGTTTGAGC 2 5500 

TGGTTCAGAG TAAAGTATTT GCGTCCAGCA TCGATTGAGA TTACCTTGTT TTTGGCAAGT 2 5 560 

TTTTCAACCT CACGTTTAGC TTCTTCTTCT TTTTGAGCTT CAGGCGTGAG GGTCAAGTTG 2 5 620 

TTGACAGTTT CTTGAAGTTT AGCAATGGCT TGATCAATCG TATCTTGTTG GGCACGGCTA 2 5 680 

AGGTTGCTAT CGAGAGAGCG AATAGCTTTT TCAGCTTCTT TTACGGCCGT GACGCTTTCT 2 5740 

GCAGTATAAC GGTTCAGGTC TTTTGGTACC TCGTTAAGTG CTTGCTCTGC AGATTCATAA 2 5800 

TCAGCTGCGA AGTATTCAGC GTTGGCATTT GCAAAATGAC GCATGAGTTT GAAGAGGCGT 2 5860 

GATGGTGAAT AACGTGCAGA TGGAGTGTCA GCCCAAGCAG CTACCATACC ACCGATGATT 2 5 920 

GGGATATCAG CTCCTTCTGT TTTTGGTACA GAAGTGATTG GTGTGTTTTT AATACCATTG 25980 

AGCCCCTGAT CGAGATTGTA CCAGCCTTGG CCATCAGCGT TTCGTCCAAG AACGTAGTAC 2 6040 

CAAGCATCAT TGGTATTAAG GATTTGGTGA CCTTTTTCAG CTAGTAGTTT AGAAGAAGCG 26100 

ACATCGTAGC CTCCCCAACC ACCAGTCCAC ATAGAAACGA TGATGTCTTT GTCAAAACTA 26160 

CCAAAGCTTG TGTCGCTATT GTAGTAGATA CCGTCGTTAA AAGCCATTGG TTTGAGACCG 2 6220 

TGCGATTTTA CAATACGAGC GAGGTCATTG GCGTAGGCAA TAAATTTTTC ATAGCCTTTT 262 80 

ACAGGGTAGC CTTCGTTTGG ATAGTATTTA TCAGCTTGAA GCACACTCCA ACCTTTAGCA 26340 

TCTGTCGCAT CATTGGCATA TTCATCAAGT CCGATGTTGA AGATTTCAGT CTTTTTCGCG 2 6400 

AAATAAGCAG CATACTTGTC GATAAGGGCT TTTGTAAAAG CGACAGCTTG TTCGTTGTCA 26460 

AGATCGACAG TACGGGCTGA TTTCTTCCCA AAATAGCTAA AGTTAGGGTT TTGGATTCCC 2 652 0 

AATTCTTTCA TGGCATTGAG AATCGCATCC ATGTGTCCAG GACTATTTAC TGTCGGAATG 2 6580 
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AGACCGATAC CTTTATCTTT GGCATAGTTA ATCAGATCTG TCATTTGACT TTCTGTTAAG 2 6640 

TGATTGCCGT TTGGATCGTT GTAATAATCA TTTGTACCTT TTTCAATGGC GCGTTTGACA 26700 

TCGTCACTGG CATAGGTCTT GCCGTTAGCT GTGATGCTCA TATCGTCCAA CATGAAACGG 2 6760 

AGTCCATCAT TTCCGACTAA TAGGTGTAAA TCAGTGTAGC CATAATGTTT CGCTTTATCG 2 682 0 

ATGATTTCCT TGAGCTGTTC TGGTGAGAAA TATTTACGTC CAGCATCAAT AGAAACAATT 26880 

TTCTTTTTCG CTAGTTTTTC ATTTACAGTT GCAGCACGTT CCTTTCCTGC CTCTGTTGCC 2 6940 

GGTTTGTCAG CCTCTGCTTT CGCTTCATCT TTTTTAGCTG GTTTATCCTT GTCAGTCTTG 27000 

TCTGTATTTG ACTCTTTAGA ATCAACCTCT TTCGCTTCTT CCTTTTTAGG GCTAGCTTCT 2 7060 

TCTGCCTTTT TATTAGCAGT TTCTTTTTCA GCAGAAGTTG GAGTTACCAC TTCTGCTTTA 2712 0 

TCACTAGGAG TTGAACTAAC TTCCTCTTGT GGTTTTTCTT CTGTTTTTGG AAGACTAGCT 2718 0 

ACCTTATCAG TAGCTGGAGT TTCTGTTTCT ACAGTTTTTG GAGCTTCTGG TTGAAGCACT 2 7240 

GCTTTAGGTG TTTCCTCAGT CCGATTTTCG GATGATTGAG GGGAATCAGA AACCGTATGG 2 7300 

ATGGTCGGTT GGTTTTCTGT AGTAGTAGGA GTAACTCCAT CGGCTGCAAC AGTCTGTGCT 273 60 

TGGAAGGCAA ATCCAATTAG AACAGAAGCT GCTCCTACAG CGTATTTACG AATAGAAAAA 27420 

CGCTGTTGTT TTTCATGTTT CATTGCAAAA CCTCCTGATT GCATTGTTAT ATTGATAGCG 2 7480 

ATTATATAAA TCAACGCCTT TATTTTATTT CTTATATTAA TTTCTTATAT TAACGAGAGT 2 7 540 

CAAGAGGAGA TGACAAAAAA CTATAATAAG TATAAAAAAA TATAAAATTT AAACTTAAGA 27600 

TTTCAGATTG GTCGGAAAAA ATACGTATAT ATATCTAGTA TAATTTTTGG TTCTATTTCT 27 660 

ATAAAATATT CCACAAATTA TAGAATTTTC CAAAAATAGG TAAGCGCTAC CTTTTTGGTG 27720 

TAGTATAATA AGCATAGAAA AAGCCCAAGC GATTAGCTCA GGTTTTCTTC TTAGTGATCA 27780 

CGGTCACATG AGATAAATTT AATCTTGTAG TAATCAGATC GTTTGTAAGT TTCACTGTAT 2 7 840 

TCTAAAACTT GGCCAGTTGA TTCGAGTTTG GTGATTTTAG TTTGTAGGAC AGTAGGGAAT 2 7 900 

TGTTCATCGA CTCCGAGGAC TGAAGCTGCA TGTTCTGGAG TTGGAAAGAC TATTTCGTTG 27 960 

ATTTCTTCAA AGTGTTCATC ATTCATGTGA ATGTGGTAGT CTAACTTGAA ACGATTATAG 2 8020 

ATAGAACTAT AGTATTCAAG GTTTGGATAA TTTGCGTTGA TATATTGTTC TGGGATGTAG 28080 

GATGTATGGT AGATATAAAC GACACCGTTT GATTCGCGGA TACGTTCAAT CTTGTAGTAG 28140 

AATTGATCGC CGCGTAGACC CAATTTTTCC AAGTAAACAA GCTTGTTTCC GCGTTCAATT 28200 

GAAAGAACAG TTACCTTATC ATCTTTAGCA TTGAAGAGTT CAATATCTGA AAACTCTACA 28260 

AGCTTGTGTT TGCGTGCACG TGAAACGAAG GTTCCTTTTC CTTGTTGGCG GACAATATAG 2 8320 

CCATCTTTGG CAAGGTCGTT TAAGGCGCGA ACAACTGTGA TAGAGCTGAC ATCGTACATT 2 8380 
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GAAATGAGTT CTGCTTCAGT GTAAAATTTA TCTCCACTGC TAAACTGCCC AGAGATGATT 28440 

TTATTTTTTA ATTCGTCTTT TATGTATTGA TGG 2 8473 
(2) INFORMATION FOR SEQ ID HO: 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6749 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
(□) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

CCTGATGGGT GGTATGCGAG GATACAGTTC TGAAAATCGC CGTTACTTAA TTAATGGACG 60 

CGAAGTCACA CCTGAGGAAT TTGCTCACTA TCGTGCGACT GGTCAATTAC CAGGAAATGC 12 0 

AGAAACTGAT GTGCAAATGC CACAACAGGC ATCAGGTATG AAACAAGGCG GTGTCCTTGC 180 

AAAACTAGGT CGAAACTTAA CAGCAGAAGC GCGTGAGGGC AAGTTGGATC CTGTTATCGG 24 0 

ACGAAACAAG GAAATTCAAG AAACATCTGA AATCCTCTCA CGCCGCACCA AGAACAATCC 3 00 

TGTTTTGGTC GGAGATGCAG GTGTTGGTAA GACAGCAGTT GTCGAAGGTC TAGCGCAAGC 3 60 

CATTGTGAAC GGAGATGTTC CTGCTGCTAT CAAGAACAAG GAAATTATTT CTATTGATAT 42 0 

CTCAGGTCTT GAGGCTGGTA CTCAATACCG TGGTAGCTTT GAAGAAAATG TCCAAAACTT 480 

AGTCAATGAA GTGAAAGAAG CAGGGAATAT TATCCTCTTC TTTGATGAAA TTCACCAAAT 540 

TCTTGGTGCT GGTAGCACTG GTGGAGACAG TGGTTCTAAA GGACTTGCGG ATATTCTCAA 600 

GCCAGCTCTC TCTCGTGGAG AATTGACAGT GATTGGGGCA ACAACTCAAG ACGAATACCG 660 

TAACACCATC TTGAAGAATG CTGCTCTTGC TCGTCGTTTC AACGAAGTGA AGGTCAATGC 72 0 

TCCTTCGGCA GAGAATACTT TTAAAATTCT TCAAGGAATT CGTGACCTCT ATCAACAACA 78 0 

CCACAATGTC ATCTTGCCAG ACGAAGTCTT GAAAGCAGCG GTGGATTATT CTGTTCAATA 84 0 

CATTCCTCAA CGTAGCTTGC CAGATAAGGC TATTGACCTT GTCGATGTAA CGGCTGCTCA 900 

CTTGGCGGCT CAACATCCAG TAACAGATGT GCATGCTGTT GAACGAGAAA TCGAAACGGA 9 60 

AAAAGACAAG CAAGAAAAAG CAGTTGAAGC AGAAGATTTT GAAGCAGCTC TAAACTATAA 1020 

AACACGCATT GCAGAATTGG AAAGGAAAAT CGAAAACCAC ACAGAAGATA TGAAAGTGAC 1080 

TGCAAGTGTC AACGATGTGG CTGAATCTGT GGAACGAATG ACAGGTATCC CAGTATCGCA 1140 

AATGGAAGCT TCAGATATCG AACGTTTGAA AGATATGGCT CATCGCTTGC AAGACAAGGT 1200 

GATTGGTCAA GATAAGGCCG TAGAAGTTGT AGCTCGTGCT ATCCGTCGTA ACCGTGCTGG 12 60 
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TTTTGATGAA GGAAATCGCC CAATCGGCAA CTTCCTCTTT GTAGGGTCTA CTGGGGTTGG 132 0 

TAAGACGGAG CTTGCTAAGC AATTGGCACT CGATATGTTT GGAACCCAGG AT G C GAT TAT 13 8 0 

CCGTTTAGAT ATGTCTGAAT ACAGTGACCG CACAGCTGTT TCTAAGCTAA TTGGTACAAC 1440 

AGCAGGCTAT GTGGGTTATG ATGACAATAG CAATACCTTA ACAGAACGTG TTCGTCGCAA 15 0 0 

TCCATACTCT ATCATTCTCT TGGATGAAAT TGAAAAGGCT GACCCTCAAG TTATTACCCT 15 60 

TCTCCTCCAA GTTCTAGATG ATGGTCGTTT GACAGATGGT CAAGGAAATA CAGTAAACTT 1620 

CAAGAACACT GTCATTATTG CGACCTCAAA TGCTGGATTT GGCTATGAAG CCAACTTGAC 1680 

AGAAGATGCG GATAAACCAG AATTGATGGA CCGTTTGAAA CCCTTCTTCC GTCCAGAATT 1740 

CCTCAACCGC TTTAATGCAG TCATCGAGTT CTCACACTTG ACTAAGGAAG ACCTTTCTAA 18 00 

GATTGTAGAT TTGATGTTGG CTGAAGTTAA CCAAACCTTG GCTAAGAAAG ACATTGACTT 18 60 

GGTAGTCAGT CAAGCGGCTA AAGATTATAT CACAGAAGAA GGTTACGACG AAGTCATGGG 192 0 

GGTTCGTCCT CTCCGTCGCG TGGTTGAACA AGAAATTCGT GATAAGGTGA CAGACTTCCA 1980 

CTTGGATCAT TTAGATGCTA AACATCTGGA AGCAGATATG GAAGATGGCG TTTTGGTTAT 204 0 

TCGTGAGAAA GTCTAAGACA GAATTTTGAG GATAAAAAAG AAGGAGCCAG CTGAAAAAAA 2100 

CTGGTTCCTT TTTAGGTACG ACAGGCATGT CGTATAGTAG AAGTGTATTA TTCTAGTTTC 2160 

AATATACTAT AGTAGCTCAG AAGTCGGTAC TTAAACGTGC TATATCAAAA CCAGTCCTGG 2220 

AAAAACGTGG ACTGGTTTCG TGTTTGGATT ATTACCTTGA ACGACATGCG TTAAAAGTTA 22 80 

GTTGAACCGC CGTATGCCGA ATGGTACGTA CGGTGGTGTG AGAGGGGCTA GAGATTATCC 234 0 

CCTACTCGAT TTTAAATCAC ATGACGTTCA AAGGCATCAT CTGAAATCCC TTGTTCCAAG 2400 

ATGAGTTTTG CCCATTCTTT AGCAGAGAAG AGGCTGTGGT CCTTGTAGTT TCCGCAAGAT 24 60 

TCGATGGTTG TCCCTGGGAC ATCTTCCCAA GTAGTAGTTT CAGCGATTTC CTTGAGCGAA 2 52 0 

TCCTTGATAA CAGCTGCGAT TTTAGCACTG GTGTGACGTC CCCACATAAT CATGTGGAAG 2 580 

CCTGTGCGGC AACCAAATGG TGAACAGTCA ATCATGCCGT CAATGCGGGT ACGGATGAGT 2 64 0 

TTGGCTAAGA GGTGCTCGAT AGTGTGAAGG CCGGCAGTAG GGATAGAGTC TTCGTTTGGT 27 00 

TGCACCAAGC GAATATCATA ATTGGAGATG ATGTCTCCTT TTGGTCCTGT TTCTTCCCCA 27 60 

ATCAAGCGAA CATAGGGTGC TTTGACAATG GTGTGGTCAA GTTCAAAACT TTCGACAATA 2820 

ACTTCTTTTG ACATGGTAAA TCCTTTCAGT TTTCTTCTCT CATTATATCA TAAAGGTTGC 2880 

TCCTGAGACA GAGAGAAAAC CTCTCCGAGG CTGGAGAGGT TGAAATCTTT ACTTACGATA 294 0 

TAAGCGGTCG TATTGGTAGT ATGGGTCAAA GGTTACGTTG ATACCCAGTT TACGAAGGAC 3 000 

ATTCTTGTCT TCATCAGTCA AGATGATGGT TGAGTGGGCT TCGCTTCCTT TGAGGTTGCC 3060 
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GAGTTCTTCC ATAGCGCGGG CAGCATCAGG ATTTTCTGTA GCTGTGATAG CAAGTGCAAT 312 0 

CAGGATTTCA TTTGAATGAA GGCGTGGATT GCGGCTACCG AGATGATCGA TTTTAAGACC 3180 

TTGGATTGGC TTAACAACTT CAGGCTCGAT TAGTTTTACT TCTTTAGCGA TGTCAGCTGA 32 40 

TTTTTTGATG GCGTTGATCA AGGCAGCGGC TGTAGGACCA AAGAGTTCTG AGTTCTTACC 3300 

AGTGATGATT TCCCCATTTG GCAATTCAAA GGCTAGGGCT GGTCCACCAG TTTCTTCTGC 33 60 

TTTTTGGCGC GCAACGACAG CAACCTTACG GTCTGCAGGT GTGATACCGA GGTCGTTCAT 342 0 

GAGCAACTCA ATTTTCTTGA CGGCAGCTTC GCCAACTTTT TCAGCTTTGA AGTCAAGAAC 3480 

TGTTTGATAG TAACGGCGGA TGATTTCTTG TTTAGAAGCT TCGACAGCGG CCTCGTCATC 3540 

TGTAATAGCG AAACCAACCA TGTTGACACC CATATCTGTC GGTGAAGCGT ATGGTGATTT 3600 

TCCGAGAATA CGTTCCAACA TGCGTTTGAG CACTGGGAAG ATTTCGATAT CACGGTTGTA 3 6 60 

GTTGACAGTG GTTTCTCCAT AGGTTTGAAG ATGGAAGGGG TCAATCATGT TGACATCATC 37 2 0 

AAGGTCAGCT GTGGCAGCTT CATAAGCCAA GTTAACTGGA TGATGAAGGG GAAGATTCCA 3 7 80 

AACAGGGAAG GTTTCAAATT TAGCGTAGCC AGATTTGATG CCATTGATTT GGTCGTGGTA 384 0 

CATATTGGAC ATACACGTTG CCAATTTTCC AGAACCAGGT CCAGGAGCGG TTACGACAAT 3900 

CAAGTTGCGA CTGGTTTTGA TGTAGTCGTT TTTGCCCATG CCTTCTGGGG AAATGATGTG 39 60 

ATCCATATCC GTCGGATATC CTTTGATTGG ATAATGAAGA TAAGAATCAA TTCCGTTTTT 4020 

CTCAAGTTGA TTGCGGAAGG CATCTGCAGC GGGTTGGCCA GCGTATTGTG TAATGACAAC 4080 

GGAACCAACA AAAATCCCTA ATTCATTGAA TTTATCAATC AAACGAAGAA CTTCTTGGTC 4140 

ATAAGAAATG CCTAAGTCGC CACGTGCTTT GGAATGTTCA ATGTTGCTAG CATTAATGGC 4200 

AATCACAACC TCAACCTGCT CTTTCAATTC TTGCAAGAGC TTGATTTTGT TGTCAGGTTC 42 6 0 

ATAACCAGGA AGGACACGAG CAGCGTGGAA ATCTTCTAAC ATTTTACCGC CAAACTCTAA 43 2 0 

GTAGAGCTTG CCGTCAAATT GGTTAATGCG CTCCAAAATA TGGTCGCGTT GTAAATTCAA 4380 

ATATTGTTCA GAACTAAAAG CTTGTTTTTT CATTTTTTTA CCTCTGGACT CTATTATAAT 444 0 

AAAAAATTGG AAGTTAGGAA ACTACGGAGC TAAAAAAGAA ATTAAAAAGA TTAAG C AAAC 4500 

GCTTGCACAA AATTTTAAAA AGTGCTATCA TAGACTATAG ATTATGAAAA TAATGAGGTA 4560 

AACAGATGCA AGAAAAATGG TGGCACAATG CCGTAGTCTA TCAAGTCTAT CCAAAGAGTT 4620 

TTATGGATAG TAATGGAGAT GGAGTTGGTG ATTTGCCAGG TATTACCAGT AAGTTGGACT 4680 

ATCTAGCTAA GCTAGGAATC ACAGCAATTT GGCTTTCTCC CGTTTATGAC AGCCCTATGG 4740 

ATGATAATGG CTATGATATT GCTGATTATC AAGCGATTGC GGCTATTTTT GGAACCATGG 4800 
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AGGACATGGA TCAGCTGATT GCAGAAGCTA AGAAGCGTGA CATTCGTATC ATCATGGACT 4860 

TGGTGGTCAA T CAT AC CT C A GATGAACATG CTTGGTTTGT CGAAGCCTGT GAAAATACTG 492 0 

ACAGCCCTGA GCGAGACTAC TATATCTGGC GCGATGAACC CAATGACCTA GATTCTATCT 49 80 

TTAGTGGGTC TGCTTGGGAA TACGATGAAA AGTCAGGTCA ATACTATCTC CACTTTTTCA 504 0 

GCAAGAAACA GCCGGATCTC AACTGGGAAA ATGAAAAACT TCGCCAGAAA ATTTATGAGA 5100 

TGATGAACTT CTGGATTGAT AAAGGTATTG GTGGTTTCCG TATGGATGTT ATTGACATGA 5160 

TTGGCAAAAT TCCTGACGAG AAGGTAGTCA ATAATGGTCC TATGCTCCAT CCCTATCTCA 5220 

AGGAAATGAA TCAGGCGACC TTTGGAGATA AGGATCTCTT GACAGTAGGG GAGACTTGGG 52 8 0 

GAGCAACTCC AGAGATTGCC AAGTTCTACT CTGATCCAAA GGGGCAAGAA TTGTCTATGG 5340 

TCTTCCAGTT TGAACATATC GGTCTTCAGT ATCAGGAAGG TCAGCCTAAA TGGCACTATC 5400 

AAAAAGAGCT GAATATCGCT AAGTTAAAAG AAATCTTCAA CAAATGGCAG ACAGAGTTAG 5460 

GAGTTGAGGA CGGCTGGAAT TCCCTCTTCT GGAACAACCA TGACCTCCCT CGTATTGTCT 5520 

CAATCTGGGG AAATGACCAA GAATACCGCG AAAAATCTGC CAAAGCCTTT GCAATCTTAC 5580 

TTCATCTCAT GAGAGGAACT CCTTATATCT ACCAAGGTGA GGAGATTGGG ATGACCAACT 564 0 

ATCCGTTTGA AACACTGGAT CAAGTAGAAG ATATTGAATC TCTCAACTAT GCGCGTGAGG 5700 

CTCTTGAAAA AGGTGTTCCG ATTGAAGAAA TCATGGACAG TATCCGTGTT ATTGGACGTG 57 60 

ACAATGCCCG TACCCCTATG CAATGGGACG AGAGCAAAAA CGCTGGTTTC TCAACAGGTC 5820 

AACCTTGGTT GGCGGTTAAT CCAAATTACG AGATGATCAA TGTCCAAGAA GCGCTGGCAA 588 0 

ATCCAGATTC TATTTTCTAT ACCTATCAGA AACTGGTCCA AATTCGCAAG GAGAATAGCT 5940 

GGCTAGTTCG AGCTGACTTT GAATTGCTTG ATACGGCTGA TAAGGTCTTT GCTTATATAC 5000 

GTAAGGATGG CGACCGTCGC TTCCTAGTTG TGGCTAACTT GTCCAATGAA GAGCAAGACT 5060 

TGACAGTAGA AGGAAAAGTC AAATCTGTCT TGATTGAAAA CACTGCGGCT AAAGAAGTAC 612 0 

TTGAAAAACA GGTCTTGGCT CCATGGGATG CTTTCTGTGT GGAATTACTA TAAATATTTT 6180 

TTGCAGAAAA ATTTAAAATT GAAATCGTAT AAAAACAAGG GAGGACTGTA TAAAAGACAG 524 0 

AAATCCTTTG TTTTTTATAA CCAAAGTTTA TAAACTTTCA TTCTTGAAAT TCAATTAACT 6300 

TTACAAATTC CCACTATTAA GGAGAAAGAA GATGAACATA AAGAAGCGTG TCCTTAGTGC 5360 

AGGCCTGACT TTTGCATCTG CTTTGCTTTT ACCCAAATCA TTCATACCTC TCTCAACTAG 6420 

ATGTAACTTA CAAAACCCCT GACCTCATGA GCCACTTTCT TCCTCCTCAT GAGGTCAGTT 6480 

TTACTTTCTG CTGTTCCAGT ATCGTTTTTC CTCGCTAGAT TTCCTCAAAA GGGCAGACTC 6540 

CTCCCTTGGT GCGTCACACG ATTTTTTCAT CTCGACTGTT CTTTAATGCA TCATTAACGA 6600 
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CGCTTTTCTT CTAGGTGGTT CATAAGGAAC AGGAAGATTC AGGTTGACTT TTCTAATCCT 6660 

AGAATAAAGT GCTGAAAACA ATTCGGAATA GGCATAGAGA CTAGACAATT TGAGGAGCTG 672 0 

CTTGCGTCCT GTTCGAACAC ATTTTCCGG 674 9 
(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1842 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

TCTACCCATG GACTTTGAGG CATTCATTGT TCCATCTTCT AGTGGCGAAT CTTTTGATAC 60 

AAACGATTCA ATTCACTTGG ATAGTGAAAC TCTCCCGCAA ACATTTTTCT GGTTAACTCA 12 0 

ATCCAGCTGA TATTTCTTTC AGCCAAAATA ATGGACAAGT TCTCCCAAAA TCGTTCAGCC 180 

ATATTGCTTC TCCTTTAGTT AGATAAATAA TGTGTTTGCG CCATGTAAAT CAATTGTTTC 240 

GTATCTCTTG GCAATAGAGC TCTAGCCTCT TCCAAATTCA GACTTGGATA AACTCGCTTA 3 00 

TTTGAAACCG CAAGAGGAAG TCTGATGGTT AGTTCAGGAT TTTTTAAAAT TATCTCAACG 3 60 

AAATCCGTTA ATCTTAGATT GTCACGGTTC TTAAATCGTA ATAAATTGGG AGATAAAAAC 42 0 

TCAAAACAAT CTGAAGAATA GCTCATCATC TCAATTAATT TGTCCTTTGT CATTTCAGAA 4 80 

ACTGAATGAC AAGATACCTC TATGCCATAG TTTTGGAAGA AATCTAAAAG AAGTTGATTT 540 

CTTTGTCTAT TTTTACTTAG ATAGAGATCA ATCATGGGAG ACCTCCCAAA GATTCGGTTC 600 

CATTTGATAT TCTGACACGA TTAAGGAATC TAATAAATTA AGGAATCTAA TAAATTTGCG 660 

AAGTTAATCG GTTTCTTGTC TTCATCATAA GCTTTTACAG TTACTTGGGT TGTAAGTATT 720 

CCCTCTTTTC CCTCGGCTCG ATAGCCTTGT CCATATAAAA CAAAAACGAG ATTTTGATGA 7 80 

TCATCTACAA AGGCATCAAC CCCATTCTTT ATGTCTTGAC TTTCAAGGAA TTCCATAACG 84 0 

TTTTGAAGAT AGGATTCGTA AAATAGTGGG TACTTATGTT TTTTATGGTA ATCATCTAAA 900 

AATGTCACTT CAAACTCACA TGGAGAGTAA TTTTGACTTT GAACAGCCTA AAAGTGCCAT 9 60 

CAAATTTGAA TTGGAATAAA TCAAATAAAT AGCCCCATCC TCATCAATCC AACCTTTGCT 102 0 

CAAAGACAAC TCCAACCGAT CTTTTAAAAC TGAGTAAACC ACCTTAACCT CCAGTTTCAT 1080 

ATTCTTATAC CGTTCACTCT CAAATAAAAG TTTGGGGAGC TTATAATAAC GCTCTGATGT 1140 

CTGATATTGA TTAGCGGTAA TACGCTTCAT TATTGTCCCT CCAAGACTAA AATTCCAACA 12 00 
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TTTCCAAATT CATCAAATCG GATTAAACCT ACTTGTTCCA TTTCATCAAC TAACTGAGTT 12 60 

GCTTTTACCC aaatcaTTCA TACCTCTCTC AACTAGATGT AACTTACAAA ACCCCTGACC 1320 

TCATGAGCCA CTTTCTTCCT CCTCATGAGG TCAGTTTTAC TTTCTGCTGT TCCAGTATCG 13 80 

TTTTTCCTCG CTAGATTTCC TCAAAAGGGC AGACTCCTCC CTTGGTGCGT CACACGATTT 1440 

TTTCATCTCG ACTGTTCTTT AATGCATCAT TAACGACGCT TTTCTTCTAG GTGGTTCATA 1500 

AGGAACAGGA AGATTCAGGT TGACTTTTCT AATCCTAGAA TAAAGTGCTG AAAACAATTC 1560 

GGAATAGGCA TAGAGACTAG ACAATTTGAG GAGCTGCTTG CGTCCTGTTC GAACACATTT 1620 

TCCCACCACG TGAAGAAAAA GATGGCGGAA GCGTTTGATT GTTAAAGTTT GGAAGTCACC 1680 

TCCAGCTAGA TGTTTGAGAA AAAGATAGAG ATTGTAGGCG ATACAGCTCA TCATCATACG 174 0 

AACTTCGTTT TTGATTAAGG TTGAACTATC CGTTTTATCG CCAAAAAATC CCTCCTTCAT 1800 

CTCCTTGATG AAATTCTCGG CTTGACCACG TCCACGATAA AG 184 2 
(2) INFORMATION FOR SEQ ID NO: 86: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19390 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

TCATCTTTAT CTCCTCGAAA TTTTCTAATA TAGCCATTAT AACAGAATTT TGTGAAAATT 60 

CCTATTATAG TAAATCACTA TTTCAGTATA AAAAGAAAAA ACGAATCAGA CGATTCGCTC 120 

TTCTTAAAAT CTGAAAATAG CTTTCCAGAA AGGATTAGCC GATTTTTTGC AGATTGAGCA 180 

CTGCATCGTG ACTCATCAAG ACTTGACCAT ACTCTTGTAA GACTGAGCGA CTGATATCAC 240 

TATCGTCTGC AAACTCGCGC ATACGGGCCA ACAGCCAAGC TGGATATGGG CTTGGATGAT 300 

TTTCAATATC CACTAAAATG GTCAAATAAT AGCGCTCGTT CATTTTGTAG AGTTCAGAAG 3 60 

TTTCCATTTC AAAAGTCACT GTCTTGGCAA AAGCTACCAA GTCAGCCAAC TTAGCAAAAG 42 0 

AAAGGATGTA GTAGATGTAA GGTTCTTTCT TACTCTCAGC TTCTTGTTCA GCCTGCTCTT 480 

GCTCTTCTTC CTTGACTTCA ACTTGCTCAA GAGATTGAAT GGCTTCGATA TCATCCTTGG 54 0 

TTTTGTCTGC GATGCTTTTT TCCAGGGTTT TGATAAATTC ATCTGGAGAC ATTTGAGCCA 600 

ATTCTTCCAT ATCTGGCAAA TCCGATAAGT CTTCAAAATC TAGATTTTGG TCAATCTTTG 660 

ACTTGGTCAC AAAGACATCT ACCTTATCAG GTTTTGGAGT CACACGGAAG CTCAACATGC 72 0 

CTGTATCCAG AAAGCTATCA GGCATCTCTA GCTCATCCAA GATAGCATAA AAGAACTCTT 780 
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CTGTTTTTTC TTGAGGAACG AGAAAGTCAG CAATCTCCAT TCCACGATCC ATCAAATCCT 840 

CTAAAGATAT CGTGATTTTT AAAGTTGTAT CACTAATTTG TTTCATTTTC ATTGCTAGTA 900 

ACCTCATACT TTCAGTTCTA TCTATTATAC TAGATTTTTA CGATTTTATC AAAAGAAGGC 960 

TCCTCTATAC GGATAGATTT TCCCTAGGGT CTTTCTATAG GAGACTCCAA AAGAAAATTT 1020 

CTGCAGACAG ATAGAAAAAG CCTTCAAAAT CGGCTAAGAG CCGACTTTGA AGACCTTATA 1080 

CATCAGAATA CTTATAATTT AAAGGTTGCT ACACCGAGGA TAGAACGATT TAAGTTTCTG 1140 

AGAATTTGAA GACTTTGCTC AAATTTCTTA TAACGAGTCA CTCCGTACTC TTCAACAAGA 1200 

AGGACTGTAT CTCTTTCCAA AAGAGATGAT ACATCCTGTA AATCTACAAA ATGCATTCCT 12 60 

TTTAAAGCTT CTTGACTCTG TTTCAATTTA TCTAAGATAG CTTTATTTGA GCTAACGATG 1320 

GTCAATTCCT GTCCAGTATT TTTGTATGAC AAAACATCTG CTAGGTTAGC AATTGTTGTA 13 80 

ATCTCTGTTA CAAAATCAAT TTGATACTGA GAAAAATCAC CTACTCTATT GATTGTTGGA 1440 

TTAAAGAGAT AAACTAACAC ATTTCCCATC ACAACCAAAA TCACACAAAC CACTCCAATA 1500 

ACAACTAAAC GAAGAATCAG ATTTTTCACA TTTAAGCCAA GCGCTGTTTC ACCATTTGCG 15 60 

TTCAATTCTT TAGAGTTGAT GGTTTCCAGT TTTTCAATTT TCACATTTGC ATAGGCATGT 162 0 

TTAAATTTCT CAATCAACCC ATCAATTTTT TTCTCTAACA AGTTATTGGC ATCTTTACTT 1680 

GATGTCAAAA TTTTCACACC AACCCCTGCA TCGTCAATCA TATAGTAGAC GGTCAATTTT 1740 

TTCCACCAAT AGTCATTCGT TGAATTTTTC AAGGTTGTTT CTGTCGTGTC TAATTCACTG 1800 

GCAATTTTTT TCAACTCACT GGGTTCTACA TCATTGAAAA GATAAGCTCC ATTCAAATTA 1860 

CCATCAATCA ATTTCCCATA AAAATCACTA TAACCACCAA TTTGATGATT CAAAATCGTT 1920 

TTGTCCGACT CTTTTGGAGG AGTGATTTTA TAGATAAGAT AAGTTGAATA ACTTGTTGTA 19 80 

TCTTTGACAG TGTTTTTATT CCTAACTGCT TTAATTGTAA ATGGTACAGC AATGAGAGCA 2 040 

AATAAAGCGA TGAGAGCTAA AATATTTGCT TTTCGCTTTT TATAAAGATT TGCAAACAAA 2100 

TCAGCTACTG AATAATGTTC AAACATGATT TTTTTCTCCT TTGTTTAGTA GATACTAGTT 2160 

TTCCTTTGTA AGCATTTTTG CTACAAATAT AATCACAAGA ACAATTCCCC AGAATTGCAT 22 2 0 

TGTAAATAAA TTGAAGAAAC TTTCTGAAAA GCTGCTTCTT GGCATAAAGA ATAGATTATT 22 80 

CAAGATGAGT AGGGATAAAG CAAATAGGAT TGTCCTTGAG CGATAGGCTA CTTGCAGCAT 2340 

GGCTATAAAT AATACGCCGA GTAAGAAACT AAGCAGAAAG ACTCCAATCA TACCATAGTC 2400 

GGTATACAAC TCCATGATAT AACTACTTCC GATACCATGC CCTTTCAAGT ATTCCTTGTT 2460 

CAAGACAAGA TAGGATAGAT TGTGGGCATA ACTATTACTA TCAATAGCTA GTTCCACACT 2 52 0 
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ATTGGTTGTA TGTTCAAAGG CTTTTCCTCC GAAAATGGCT CCCAAACTCC CCCTTGCAAA 2 5 80 

ATAATCAAGA ACAGGACCAA AAGTAAAATT ACGGAAATCT CGGTAAGGGA GGCTACTGTT 2 640 

AAATAGAAAA CCTCGAGCCA GAACACCAAA ACTAGTCCCT TGTTTATAGA TAAAGTCAAG 2700 

TAAGATATCC CAGAAACCTG TATGGGAAAC TTGGACATTA TCCCGTACAT AATTGAGTAC 27 60 

TCCCATCGCT AACATGAGAA TAGGAGAACC TACAAAAATC GCTAACTTTT CTTTAAACCC 282 0 

AATCCATTTT CCTTTTTCAG TTTGCTCCCG CATAAAGTAA TAAACAAAAG CAAATmAAAT 2 880 

ACTTAAAATA AAGGGATTTC GTGTCCCAAT TGCCAAATGA ATACTATTAG CTGCAATAAA 2 940 

GGAGACAAGC ACTGCTGTGG CCTGCAATTT CTTTGGCTTG GTTGCCAGAT ACATACACAT 3 000 

TGCATAGACC GTAAAGGTAG ACAAAATGTA GGTAAAATAA GGCAGTTTAC TTTCAAAATT 30 60 

TGCATAGTAG GCATAGTAGG AAGTCTGCAA ACGATACAAG AGCCGTTCAA ATAACCGAAT 312 0 

GAAATAGAAA GGATAAGTTA GAAGAAAAAC TCCTAGTGAT ACAAAGCGTA ACCGCTTGAT 3180 

ATAAACCTCT TTTAGAGAAT TTCCTATATT TGCTACTTTT ATTTTCTTCC TAGCTATGAA 3240 

GTAACGAGCC AGAATGCCTC CTGTGGTCAA GCCCAGAATC GAAATCATGA CAACTATAAA 3 3 00 

GGCAAAACGA TAGGCTATTG GATGATAGGT ATCCAAAGCA CCATCCCTAA AATAATCAAT 33 60 

GGTCGGTCTT GATACCAGAA ATACAAAAAT GGTTAAATAG AAAATAAAAT GGATTAAGTA 34 2 0 

ATACTTGATA TCATTCCAAC AAGCAATTAA GCTACTAACC AACAAGAACA ATAAAGTAGA 34 80 

AAGTAAGCTA ACATTATTAT TATTAAACAG ATACACAATT CCACTTACTA GCGTCAAGGC 3540 

ATAACTGACT ATGGTCAAAC TAAATAATAA TCGTTTCCCA TCAATCACTT GGTCACCCCC 3600 

GTTCTAATGT AATTTTTTAG ATTTTTCAAT ATTTTTCAGT AATAAGAATC GATATAAGGA 3660 

AATATTTATG AATAGGGCCA AAGCACTAAT TCTTCTCCCC TTACGGAAAA TTGGATTCCT 372 0 

AGAAATAGCA AAGGCATGGC CTTTTAAAAA ACGATGAATC TGAGAATAGG CTTCAAACTG 37 80 

TTTATACTGA TCATCTAGCA ACATCTTATC CAGAATAAAG AAGTGGGCAT AGGCCAATCT 384 0 

GAAAAAAGCG ACCTCTTTCA AGTCAGGATA GTTTTTCACA ACTTCATTAT AAAACTTTTG 3 9 00 

GTAGATATCA ATATAGGCTA AATCCTTCTC TGCATAGGGT TTGGTCGTAA TACTATCCCC 3960 

TCTATGGAAA TAGTAATAAT AGGGTTTAGT ATTAACCACA TACTTCTTGG CCAACTTGAT 402 0 

TAAATCAAAA TGGTAATAGG CATCTTCGTA AATCAACCCC TTAGGAAAGG ATAGGGCAGT 4080 

TGCAATCTGT CTCTTGATTA GCTTATTGCA AATCGTCCCA GGTATTTTTT CACCTATGAG 4140 

GTATTCCTTT AGAAATGTTT GAGAATCACA GACAAAATAG TCATCCTGAT TGGCTGACTG 42 00 

TGGGCTTTCA TCATTAGCAT AGACATTCAT GACACCACAG CTCGAAACAT CCGCATCTTC 42 60 

TTGAACTAAT TGCTCATATA AGCTCTGAAT CATTTCTGGA TGGATATAAT CATCTGAGTC 4320 
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AATAAAAATC AGATAATCCC CGTGAGCCTG CTTCATCCCA TCATTTCGTG CTTGCGACAA 4380 

TCCTTCGTTC TTTTTATGAA GCACTGACAC CCTGTCATCT TGTTCAGCGA TTGAATCACA 4440 

CAAGCGACCA CTTTCATCTG TTGCACCATC ATCAACAAGA ATAATTTCCA GATTTTGATA 4500 

GGTCTGCTTC TGAATGGAAG CTATCGATTT TTCTAGGTAC TGCGCCACAT TATAGACTGG 4560 

CACAATCACA CTAATTAATG CAGTTTCCAT GCTACTCCTC TAATAGTTTT TCTACTTGTT 462 0 

CGATTTGTTT TGTAATTGTA AATTGTTGAA TGAATTGGCT AGCCTCATCG ACATCAAAGT 4 680 

TTGAGGCAGA AGTCATGTAA TTAGTAATCG CCTGAGCTGC CTCTTGATTG CTCTCAATGA 474 0 

TTTGTCCAAA TCGTCCTTCT TGGGATAATT CCTCAGCCCC TCCAACGTCC GTAGAGATAA 4800 

AAGGGAGTCC CAGACTCAAG GCCTCCACAT ACACTCCAGG AAAACCTTCT TGTTTAGACA 48 60 

TAGACAAAAG AACTTTCGTC TGAGATAGAT ACTGATAAGG ATTTTTTTGA TAACCAAGGA 4 92 0 

AATGTACATA GTCCTCAATC CCATACTCTT TGACTCGTTT TTTCAGTTCC TCTTCCATAT 4980 

CACCAGCCCC GATAAAATAG AGATGATAGT TTTTTCCCTC TTGGTGTAAT AATCGTATCA 504 0 

CTTCCACTAC ACGGTCAGAA CCCTTATTTT CCTCAATCCG TCCGATAGTA CAGATACTTT 5100 

GAGGAGCAAT CTCGATATCG ATCTTCTCTT GAGATTTTTC TAGAATAGTC TGAAAATCAT 516 0 

ATCCATTGTA GATTGTCTGT AATTTAGAAG TATAATCTGG ATAAACTTCC TTGATAGAAT 522 0 

TGCTGGTCTT TTTTGAAATC CCTACAATTG TATTCGCAGC ATCCAACTGG CTTCTATGTG 5280 

ATTCTCTTTT AGAGCTATCC TTAAGAAGTT CTTCAATACT TCCATGAATC CAAGATATCT 5340 

TCTTGACTTC TCTTCTTTTA GAGAACAACA GTGGTGGATT CATAATGGTA AAAGAAACTT 5400 

CAACATCATA ATCATCTTTT ACAAGCAAAC GACGAGTCAG TCTTGGAAAA TAAATTCTCA 54 60 

TTCTCCACAA AAAAGCTCGT AACCATCTGG TTTGGCGATA ATCTTGAAGG GATTTTAAAA 5520 

TGCGTACATG CTTTGGAACA GATTCATATC CCTTGTCAAA GTGCTCCATT TCAAGAATAT 5580 

CAATATCATA CTTTTCTGGA TCCAGATTTG AAACAATGGT TGATAGAATC TTCTCTGCAC 5640 

CACCTCCAAG AGAAAAAGAC CACATAAAAA AT AAGATT T T TTTCTTAGCC ACCATATTCT 5700 

CCCTTGTATT CTGTATAAGA CTTATCCATA TCAGCGATGA CAGCATCATG ATGCGGTACC 57 60 

TGCTTGTCTG CTGGTGGAGG CGTCATATAA TCCCCAAAAG CAGTTCTGAG ATAGACATCA 5 820 

TAGCCGATTG GAATAGGCAT CTCTGTTCCT TCAAATGGCA AGAAAAGATT GTCTTCAAAA 5880 

GATGTGATTG GGTACTTGTT TCTCATGTAG CCAGGACCTG AGCATAATTC TGTAATGCCA 5940 

TCACAATCAG CCAAATCATA CTTAGTCATT TCTTTCTCAG CTTTTTTCCA GATGCGATAA 6000 

CGGAGAGATT TTGGAGTCAA ACCCAGTAAA ATGCGACTTC CCCATTTCAT GAGATCACCA 6060 
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TGCTTTTCTG GAATAGTTTG CGCACAAAAG AGTGAATAAA TCAAGGCCCA ACGAACCTGT 612 0 

TTTTTCCGCT CAGCTGGATT TTTCGGATAA TAATCCAAAG GCAAAACATC CAAGGCCAGA 6180 

CCATGTGGCA AATCCAAATC CTGCTGATAA GGCTTGATAC AGGTGGTTTT CTTGTCACGA 5240 

ATGGTAATAA AAAGATTACG ATCAACAAAA TCCTTGTGAC TCTTTGACAA GAAATAACGT 6300 

TCATCTGCAT AACGAGGCCA TAATTCTGCT AATTTCTCAT AATCTTTACG AGGCATAAAA 63 6 0 

AAGTCTAGGT CGTCGTCCCA AGGAATAAAT CCCTTGTTTC GAAGGGCACC AATAGCGCCT 6420 

CCGCCACAGA GATAACAGAG CAAATCATGT TCTTTACAAA AGGCCACAAA ATATTCAGCC 6480 

ATCTCCAGAC TACGAGCCTG AATTGCTTTT AAATCAGTCA TATTGTTCAT TATTCTTTCT 6 540 

ATCGTATCGT TTCATTATAC CACAAACAAG GGGTGAAAAT CTATTGCAGA CTGTAAAAAA 6 600 

TCAAAGCCTG ACTGCTATCC AAATAGCTAT CAAACTTTGA TTTTTCTGTC TTATACTCTT 6 660 

CGAAAATCTC TTCAAACCAC GTCAGCTTCA CCTTGCCGTA GGTATAGGTA ACTGACTTCG 6720 

TCAGTCTTAT CTACAACCTC AAAACTGTGT TTTTAGCAGC CTGCGGCTAG CTTCCTAGTT 6780 

TGCACTTTGA TTTTCATTGA GTATTATCTT ATCTTAAGCC CATTTGAGCG AGCTTGGTTT 6840 

GATATTTGTT TTGATCAACC AGCAGGCCCA AGCCCCCATA AACATCATAG GCATCTACCC 6900 

AGTCACCCAG TTCTGGAATC GTCAATTTTT CAATACCATT TTTTGCTCCA TCCAAAACAG 6960 

ATAAACCGTT TGTTAGGAGG AAAGTATAGG GTACGTTGGT TGAGGTCATA GCAAAAACCT 7 020 

TTCCAAGAGC TTCAGAACCA GTGAAAAGTT TAGTGGGATC TTTAATTTGC TCTAAAATTG 7 0 BO 

CTGTTAAAAC TTGTTGCTGT CTTTTTGTAC GGCCGTAATC TGCCTCATCA TCATCACGGA 7140 

AACGAGCATA ATTGAGCAGG GTCGAGCCAT TCATCTGCTG TTTTCCGACT TTAATGGTTT 72 00 

GGGTTGGAGA CTCAGTCTCG GTAGCGTATA AATCATCTCC GACTGTAGCT TCTGTTAGGG 72 60 

GACGCCCATT CAATGTTGAA AATTGAGCAT CAATCGTCAC CCCATCAGGG AAAAGCGTGT 7320 

CAATCGCTGT GGCAAAGGCC TGGAAATCAA CCAAGGCGTA GTACTTAATG TCCAAGTCAA 7 3 80 

AATTATCTTT CAAGACTTGG CGAACCATTT CTGCCCCTTT TTGCCCCTCT TGTTCTCCTA 744 0 

ACTCGTAGGC TACGTTTAAC TTGTTATCTG TCTGTTTTCT ACCATTAATC ACTTGACTAT 7500 

AACCATCTAT AT AG AC C AAA TTATCACGCA TGAAACTGAC TAGCTTCATT TTCTTATCTG 7 5 60 

AGCCCCCGAC ATTTAATACC ATAATAGAGT CAGTTCGTGT CTCAACACTG TTCTGGCCGA 7 620 

TTCGACCATC AGTACCCATG ATTAAAATAT TAACTCCATC TCTAGTGTCC TGACCATTAA 7 680 

AGACTTCTAC TTGAGCTGCC CGGGCATCAG CAGTTTTCTT TGCGCTAGCA TCTTGGTAAC 7740 

CACGCAAAAA CATGAATACC ATGGCCAAAG CCACACAGAC CAAAAGTGAA AAAATCACCA 7800 

TAAAAATTCG TTTAAGACGG AGCTTCCGTC TTTTCTTTTT TGGAGGGAAA GAGAGTGCTT 7860 
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GTGATTTGGA TTGTGAGCGA CTCCGGTTCG CATAGCTTGG TAAGTCAACC TGCTCTTCTC 792 0 

TTTCTTGTTC CAAGCTAGAG CTACTATTTC CCCTAGCAAG AGTTAGCTTT TCTTGCAAAT 7 980 

AGGCAAACTC ATTTTTTTCT CTCTCATTGA GATAGTGAAT ATTTTTTAGC AAATAATCAT 8040 

AACGCAACTG CTCATGATGA CTTAAGGGAT TTTCTTTACT CATCTTCTCT CCTTTCCATG 8100 

GTCTGATATT GGATAAATAG GATAGGCACC CAGAATTTTA TACTGGATTC CAATCGCTTC 8150 

TAATTCTTTT TGGGCAAAGT GGACCAAGTC CTTATCGGTA TAATCCACAT CGATAATGAA 822 0 

AAAGTATTCA CCCAGTGCTG TCTTGAGTGG ACAACTTTCA ATTTTTGTCA AGTCAATTCC 82 80 

TCGCCAAGCA AAGGTCGACA GGGCCTTATA AAGTGCACCT GGAAGGTTGT CAGGTAATGT 834 0 

CAAGGCCAAA CTCATCTTTT CAGTTTGTGC TTGCAAGGGA ATACTAGGCT TTTCAGCTCC 8400 

TAGAACCCAG AAACGTGTGA AATTGGCTTC CATTTCCTGA ATATCCTCGG CAATCAGTTC 84 60 

CAATCCATAT TCTTCAGCAG AACTTCTAGG TGCAACTGCT GCAAAGGGCT GGTCTGGATG 8 52 0 

TTCGGAAATA AAACGGGCCG CATAAGCTGT ACTAGCTGTT ACCTCGATTT GAGCCTCTGG 8 5 80 

ATATTGTTCA TCGATGAATT TCTTTCCTTG AGCCAAGGCC TGTGGATGTG AAAAAATCTT 8 64 0 

TTCAATCTTA GTATGGCCTG GAACCACCAT CAACTGCTGA TGAATAGGCT GAACGATTTC 87 00 

TGCTACTGCT TGGATGTGAG CCTGATGAAA AAGATAGTCC AAGGTTTCAT GAACACTACC 87 60 

CTCAATAGAA TTTTCAACTG G C ACC ACAGA ATAGTTCACT AATCCTTGCT CATAAGCCTT 882 0 

GATGACATCT GTAATGTTGG CAAAAGCCTG CAATTCCTCA TGAGGAAAAG CTGTCTGCAC 888 0 

AACGTGGTGT GAAAATGATC CCTTGGGACC TAGATAAGCA ATTTTCATCT TAGTTCCTCT 8940 

ATAATTTCCT CTGGGCTTAG CTTGGTCACA TCCAAAACCC GACTAGCCAC TTCCTCATAC 9000 

CAAGCCTGTC TTTCTTGGAA AATAGCTACT AGTTCTTCCT TGCTATTATT TAGAAAAAGC 906 0 

GGTCGCTGAT TGTCCTTATC AGCTGCGATA CGTTGGTAGA GGGTTTCAAA ATCTGCTCTC 912 0 

AGGTAGATGT TATCTGTATT AGTCTTGAGT AAGTCACGAT TTCTCTGAGA AATAACCACT 9180 

CCTCCTCCAG TTGACACGAC TTGGTCTGTT TGTAGTAAAT CAGCTAGGAC TTCTGATTCT 924 0 

ACCTGACGAA AGGCTGTTTC TCCCTTTTCA GCGAAAAAAT TCGCAATGGA CATACCTAGG 9300 

CGATTCTCAA TCAGAGCATC CATATCAAGG TAATTAGGGT CCAAGCCTCT TGCAATAGTC 93 60 

GATTTTCCAG CCCCCATAAA CCCTAATAAC ACCTTAGCCA TGAATCAAGC TCTCCAAATC 942 0 

ATCAAAGAAA CTAGGATAGC TGGTATTGAT GGCTTCTGCA CGGTCAAGCT CCACCTCTCC 94 8 0 

ATCTGCAACC AAGAGGGCTG CGATAGCTGT CATCATGCCG ATACGGTGGT CACCAAACGT 9540 

ATTGACTCTA GCACCGTGAA GAGCTGATTT TCCTTTGATA ATCATCCCAT CTGCCGTAGG 9600 
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AGTAATATCT GCTCCCATAC TATTTAAGGC GTCTGCCACA ACCTGAATAC GGTCTGTTTC 9 660 

CTTGACCTTG AGCTCCTCAG CATCCTTGAT AACTGTTACA CCTTGGGCTT GGGTCGCAAG 9720 

CAGGGCAATA ATGGGCAATT CATCAATCAA TCGTGGAATC AAAGCGCCAC CAATCTCTGT 9780 

TCCTTTCAAG TCAGAAGACT CAACAATCAA GGTAGCAGAT TTAGCGACTG GATCGATTTC 9 840 

AGTTATTTCC AATTTTCCAC CCATGGCACG AATGACATCA ATAATACCGG TGCGAGTTTC 9900 

GTTGATCCCC ACATTCTGCA GCACTAGACG AGAATTTGGA GCAATCAAAC CTGCGACTAA 9 960 

CCAAAAGGCT GCACTGGAAA TATCTCCTGG TACGACCACC TTCTGTCCTG TCAATTTTTG 10020 

TGGCCCCTGG ACTGTGATTT TCTTACCATC CACACTTAAA TGACCACCAA ATTGTTTCAA 10080 

CATATCTTCA GTATGATTAC GGGTGTACTC TTTTTCGATA ATAACTGACT CCCCCTTAGC 10140 

TTGTAAGGCT GCAAACATCA AGGCTGACTT GACTTGGGCA GAGGCAATTG GCAACTCATA 102 00 

ATGAATAGGT CTTAGGTTTT TCGTCCCTTT TAAGCGAAGG GGAGGCAAGT CTCGTTCAGT 102 60 

TTGCCCTGAA ATGCTGACGC CCATTTTTTT CAGTGGAAGG GTCACACGGT CCATAGGACG 103 2 0 

TTTGGAAAGA CTATCATCTC CAAACATCTC TACTTCGAAA TCTGCACCAG CAAGGACACC 10380 

TGAAATCAGG CGAATCGAGG TGCCAGAATT TCCCATATTA AGGGCATTTT GTGGCGCTTT 10440 

TAAGCCAGCC ATGCCTACAC CTTGAATGGT AATAACCCCA TCTTTATCCT CAATTTCAAC 10500 

ACCAAGGTCA CGAAAAACCT GCATGGTCGA AAGAACGTCT TCACCTCGCA GAATATCATA 10 5 60 

AACCTTGGTC TCACCCTCAG CCAAACTTCC AAAGATAATG GAACGGTGGC TGATAGACTT 10620 

GTCACCTGGG ACGCGGATAC TACCATGTAA ATGGCGAATG TTTGTTTTTA GTTTCATACT 10 680 

GGACCTCATA CTTGCAATAC TTTTACCTAT TTTATCATAA AAAGCCAGAA ATTCCTTAAA 107 40 

AATTCCTGAC TTTAGGATCG TTCTTTTCTT ATTTCAGCAA TTCTGAAACT GGTTCAAAAA 10800 

CAATTTTTTC AATATCAGAA AGGTAAATGG CCAATTGTTG TTGCTTGGTA AAGAATTCTG 10860 

AC AAG AG G C T ATTTCCTTGA ATCTGTTTAC CAAAGCCTTC CATCTTAGCT TGGAAGGACG 10920 

CATCTGGCAT TTGACCTGTC TGTGCTAGTT TTTGAATTTC CTCTTGAAAG GCAAGATAAT 10 9 80 

CTGTAAAGAT TTTGCTTGCC TCAGCATCTG CTGCAATCGC ATCTTTAGCT GCTTTAACAC- 11040 

CCTTGTATTC TGGTAATCCG CGTAGACCGC GACTGAGTTC GTTTGCACTA TCGTAAATAT 11100 

TTGACATGTT CTTCTCCTTA TTTGATGACG ACTGTATAGT CAGTATTTTC TGTTATGAGA 11160 

TGCTCAGCTC TTTCCAAGTC TTGAGCATTT TTAAATGAAA TTTGTAGGAT TCCGTGAATA 11220 

TCCTCACGAT TTTCCTCGTT GATGTGGATA TTAACCAAGG AAGTTCCACG TAGCAGTTCC 112 80 

AAAATCCGCA GGATGACATC TTCTTCATCA GGAACGTCAA CATAGAGGTC GTAAGAGCTA 11340 

TCCACACCAC CACGCTTATG GATTTCCATG GTCTGGCGTT GTTCACGCGC TTGGTTAAAA 11400 
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AAGTTCCAAA TTTGCTCTTC ATCTCCCTTA CTAATGGCCT GACCAATCGC TTCCAAACGT 11460 

TCCTTGAAAT CCTCAATTCT ATCCAGAATG ATCTCGCTAT TGGACAAGAG AATGGAGGTC 11520 

CACATTCCTG GCTCGCTTTC CGCAATTCGG GTCATATCTC GAAAACCACC TGCCGCAAAG 11580 

CGCCTTGCCA TCTCATGCTC TTGAGCATAG ACCGCAGTCT GCTCCATGAG ACTAGAAGCC 11640 

AAAATAT GAG GAAAATGGCT AATCTGAGAA GTGACACGAT CATGCTCCTT GGCATCAATC 11700 

TCGATAAAAC GAGCATGAAG ACCTGAAAGC AGATCCTTCA TTTCCTTAAG CGTGTCCTGA 117 60 

CTTGTCAGGC TTGAAGGTGT AAAGATATAA TAGGCATTTT CAAAAAGATT GACATCTGCC 11820 

GAAGCAGCCC CTGTCTTGTG ACTACCAGCC ATGGGATGGG CCCCGACAAA GCGAACAGAC 11880 

TTGCCAGCCA AATACTGCTC CGCCGCATCC ACAATGGTTG ACTTGGTCGA ACCAGCATCT 11940 

GAAATAATAA CGCCTTCTCG CAAATCCAAA TTGGCCAACT CCTTAATGAA AGCAATAGTT 12 000 

TGTTTGATTG GCAAGCTGAG GATAATGACA TCTGCCAAAG GAGCAAAACT AGCAAAATCA 12 060 

TCCGTTGCAC GGTCAATCAT ACCTTCTTTC AAGGCGATAT CTCTCGAAGC TTGACTACGA 12120 

TTATAACCTA AAATTTCATA ATCTGGATGA TCGCGTTTGA TACCAAGTGC CATAGAGGCT 12180 

CCAATCAACC CAAGACCTGC GATATAGATT GTTTTTGCCA TAGGAACTCC TTAATAGTTC 12240 

TTTGTATAGT CTCGGTGTTT GGCTACCGCT TCTTTTAGTT CCTCAAGATT ATCTGATGAG 12 300 

AATTTTTCGA GGATTTCTTG CGCCAGAACC GTTGCTACAA CTGCTTCCAT GACCATTCCT 123 60 

GCAGCTGGAA GAGCAGTCGG ATCACTTCTC TCCACGGTTG CCTTGTAAGG TTCGTGGGTT 12420 

TCGATATCCA CACTCATAAG AGGTTTATAA AGAGTAGGAA TGGGTTTCAT GACCCCACGA 12480 

ACAACGATGG GTTGCCCATT AGTCATACCA CCTTCAAAAC CACCTAGATT ATTGGTACGG 12 540 

CGAGTATAAC CGTCTTCTTT AGACCAGAGA ATTTCATCCA TAACTTGGCT GCCTTTACGA 12 600 

TAACCAGCCT CAAAGCCAAG ACCAAATTCC ACCCCTTTAA AGGCATTGAT AGAGACAACA 12 6 60 

GCTTGAGCCA ATCTTGCATC CAATTTTCTA TCCCATTGGA CATAGGAACC AAGACCAACT 12 72 0 

GGAACGCCTC CGACGACTGT CTCCACAACC CCACCGATGG TATCACCATC ACGTTTGATT 127 80 

TGGTCAATAT AGTCCTTGAT TTCCTGTTCT CGTTCTTGGT TGACAATAGA AACTTCAGAC 12 840 

TGGGCAGCTC TTTGCTTAAT TTCAGCGACT GTCAGATTTT CAGGAACATC GATTTCCTTG 12 9 00 

CCACCAAAGA CCACGACATG GTTGGCAATC TCCATATCCA GCTCAGCCAA GAGGCGTTTG 12 960 

GCTACTGCAC CAACTGCCAC CCGCATGGTG GTTTCACGAG CTGATGAACG CTCCAAAGAA 1302 0 

TTTCGCAAAT CATCAAAACG GTACTTAATC CCCCCAACCA AATCGGCATG ACCTGGGCGA 13080 

GGATGAGTAA TTTTCCGCTT GCTTTTAAGG CGGTCTTCAA TGTCCTCCGC AGACATGATG 1314 0 
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TCCAGCCATT TCTGGTGGTC CTTATTGATG ACATCCATAG TAATAGGCGC CCCTGTCGTC 13 200 

TTCCCGTGGC GAACGCCCGA AGTAAAGACA ACCTGGTCAT TCTCAATCTT CATACGACCA 13260 

CCACGACCGT AGCCACCCTG ACGGCGTCTA AGGTCCTCAT TGATATCCTC AGCTGTCAAT 13 32 0 

GGAAGTCCAG CTGGAATTCC CTCAATAATA GCTGTTAGAC GGGGGCCGTG TGATTCTCCT 13 380 

GCAGTTAAAT ATCTCATACA CTCTCCTTAT TTTACCAAGT AGTCTTTCAT CTCTTCCAGA 13 440 

GAAACTGGGT GAATGGTCGC TGAACCAAGC TCTGGCACCA AGACCAATTT CAAGGTGTTA 13500 

CCACGCGCTT TCTTGTCATG AGTAAGAGCC TGATAAAGCT TGCCAACTTC CCAATTTTCA 13 560 

TAGTCAACAG GCAAACCGAA TTTCTGACAC ATCTCTGTGA TAGATTGGGT AATGCCAGCT 13 620 

GGCATGAGGC CTTTTTCCTC AGCAACCTTG GAAATCTGTA CCATTCCCAT GGCAACAGCC 13 680 

TCTCCATGCA TGACCTTGCC ATAACCGGCA GTCGCTTCGA TGGCATGGCC AATAGTGTGG 13 740 

CCAAAATTGA GGTAAAGACG AATACCATTG TCCAACTCAT CTTCAACCAC CATCTTGCGC 13800 

TTCACCTGAC AAGAATGTTC AATCAAGGTC TCTGCATGTT CCAAAATACT CTCAACAGAA 13 860 

CCATTCAGTC CCGTCAAGAG AGCCCACAGT TCTGGATCCT CAATCAAGCC ATACTTGATA 13 92 0 

ACTTCACCCA TCCCTTCAAT CAACTCTCTT TTTCCGAGGG TTTCAAGAAC AAGTGGATCA 13 980 

ATCAGAACCC CATCTGGTTG GGCAAAGGTC CCCACCATAT TTTTAGCAAA TGGTGTATTA 1404 0 

ACGCCTGTCT TTCCACCGAT AGAAGAATCA ACCTGAGCTG TCAAACTAGT CGGAATCTGA 14100 

ACAAAGTGAA TACCCCGCAT ATAGGTAGAG GCTACAAATC CAGCCAGGTC CCCAACAACG 14160 

CCACCACCAA GAGCAACGAT TCCATCGCTA CGAGTCAGAC CTTGCTTGAC TAGAAATTCA 142 2 0 

TAGACTTTCT GAACAGTAGT TAAATTCTTT CTTTCTTCAC CTTCTAAGAA ATCAAAAACA 14280 

GCTACCTGAA AACCAGCATC TTCTAGGCTG AGCTTGACCT TCTCTGCATA GAGAGAGGCT 14340 

ACATGGTTAT CTGTCACAAT GACTACCTTT TGCGGTTGCC AGAGTTCTCG CAACCACTGA 144 00 

CCAGCCTGGG CCATACAACC TTTTTCAATC TGAATATCAT AAGGATGGTG AGGAATATCG 144 60 

ATTCTGATTT TCATAGGAGA GTCTCCCTTT CTTTATTGGT ATTTTTCTGT TAAAGACTGC 1452 0 

CAAATCTCTT CTGTCGGCAT TTCCTTGCCT GTCCACAGTT GAAAAGCTTC TGCAGCTTGA 14580 

TAGAGTAACA TTCCCAGACC ATTGACTGCT GGATTGCCCT GACTTCTAGC CCATTTCAAA 14 64 0 

AACGGTGTTT CAAAGGGTTG GTATATGATA TCTGCAACTA AAAGAGTTTC TGGTAAGACT 147 00 

ATGTTTTCAG GAACAGGAGA GGATTGGCCA TCCATGCCCA CACTGGTGGC ATTAACTAGC 14760 

AAATCCGACT CGGCAATCCT TGCTTGCAGT TCAGAAACAT ATTCTAAAGC ACACAAATCC 1482 0 

ACTTTAAAAC CTGTCTGCTC CTGTAACTTG TCTAGGTAAG GTCTTGTTTT TTCCATAGAA 14880 

ACGGAACGAA CAAAGACCGA AATCTGACTG ACGCCATCCA AAATAGCCTG TGCCAAGATT 1494 0 
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GATTTAGCCG CACCACCTGC ACCCAGCAGG GTCATCTTTT TACCTGAAAT TGTAAAAGAA 15000 

GGCAAGCACT TAAAAAATCC CTTGCCATCT GTATTATATC CAATTAAATT GCCATTCTCA 15060 

TTGACAACCG TATTAACCGC ACCAATCAAG CGCGCTTCAT CGCTCAGCTT ATCCAAATAA 15120 

GGAATCACCT GCTCCTTATA GGGCATGGAC AGATTGATGC CAAACATCTG GTAGCGACGA 15180 

ATATTGGCCA CTGTTTCTAC CAAGTCACTC GCTTCAATCT CCCAAGCCAC ATAAGCACCG 15240 

TTGGTAGCTG TCGCCTCAAA GGCTCTATTG TGGATGAAGG GAGAAATAGA ATGCTTAATA 15300 

GGATTGGCAA CAACTGCAGC TAAACGTGTA TAGCCATCAA GCTTCATCCA AAATCTCCCT 153 60 

GATTTTTTTC ATGCTAGCTA GAGAAATCTG CCCAGGGGCA CTAACCTCAT CCAGACTGGC 1542 0 

AAAAGACCAA CTCGAACCAG TCACATCCGC AGTGATACGA GAGACCTTGC CCACCTTACC 15480 

CATAGAAATG GTCACATATT CCTGTTCAGG ATTGAGGGTT TTAAAGCCTC GTGTATAGTT 1554 0 

CATCAAGTCT AAGACATCCT GCTCCGTGTG AGCCATCACC GCAACCTTAA CAAGTTTTGG 15600 

ATTTAGGATC GTCAACTCTG ACAAGATTTC CATCATGTTC TCAGGTGTTT CTTGGAAATT 15660 

ATGGTAACTC AAAACAAGAT TTGGGAAGTC CAGCATTTCC TCAAAAACAT CCTTGTAGCT 1572 0 

ATAGTACTCA AAATCAATAT AGTCTGGTTG ATAGAGTTGC GCAACTTCCT TGATTAGATG 15780 

GATATACTCT TCTGGAGAAA GGTCGATTTC TCCACCTTCG GAGCGAGTTC GTAGCGTGAA 1584 0 

AACCAACTCA CGGCCTGCGA ATTTTTCAAA AATGGCTGGA GCTACCTGCA AAATCGCTTC 15900 

TTTAGGCAGA TAGTCGGCAC GCCATTCAAT GATGTCGGCA TCCAGGTACC TCGTGGCATC 159 60 

CAGAGCCTGA GCCTCCTCTA AACTTCTTGG CATTACTGAA ACGATTAATT TCATTTACTA 1602 0 

ACCTTCATAC TAATCACCTT GAGGTAATTA CTACTTTCAT CTTTTTTATT ATAGGCAAAA 16080 

TCTGCTGGAA GACCATATTT GTTTAAAATC TGGTAACTTC TTCCTGCAAA ACCTTTATCA 1614 0 

ATTTGTTCTG TAAATTTCTG ACGGGAAACA TTGGCAGC AT TGGTACTGGC AATGATAATC 16200 

CCTCCCGGAT TTAAAATCTC AAGACTCTGG GAAATCAACT TGTGATAATC CTTGGCCACA 16260 

GAGAAAGTTT GTTTTTTATT CCGAGCAAAG CTAGGCGGAT CTAGGACAAT CACATCGTAG 16320 

GTCAAGTCTT TGCGTTTGGC ATATTTGAAA TACTCAAAGA CATCCATGAC TATAAAACGA 16380 

TGCTCGTCTG TGCTGAGCCC ATTTGCCTGA AAATGCGCTT GAGACAATTC TCGTGAACGT 16440 

TTGGCTAGAT CAACAGAAGT TGTATGGCTA GCTCCTCCCA TGGCCGCAGC TACTGAAAAA 16500 

GCCGCTGTGT AGGAAAACAT ATTGAGTAAG GATTTACCCA TAGCCAAGCC GTCAACTAAA 16560 

CTACCGCGAA CCTCATGCTG GTCTAGGAAA ATTCCTGTCA TCAAGCCATC ATTCATAAAG 16620 

ACTTGATACA GGACACCATT TTCTAAAACA TTGAAAAAGT CAGGTGCTTC TTGACCATAA 1668 0 
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ACATGGGCAG ATTCATAGTC CAAACCCTTA AAGCGGATTT TCTCATAAGC TCCTAAAACC 16740 

TCAGGGAAAA CCTGTCTAAA GGCTTCTGAT ATAGTCTGAC GAATCTGATA AACATAAGAG 16800 

TTATACCAAG AAAAGACGGC GTAGTCGCCA TAAAGGTCCA CTGTCAGACC CCCAAAGCCA 16860 

TCTCCCTCTT GATTAAAGAG ACGAAAGGCA GTTGTCAAAT CATCTTGATA GTAGGCGTTT 1692 0 

CTCTTTTCTT TGGCTTTTCT AAACAACGTT TCAAAGAAAG CTTGATTGAA GGCCACCTTG 16980 

TCTTTGCTGA TAAACCAGCC CAAGCCCTTG TTTTGCTGAG AAAGGTAGGC AGTCCCAAGA 17040 

AAGTTTCCTT CCTGACCCTG CACCTCTACT TCCTGATCCT TAAGATTGAC ATTCTCAAGA 17100 

TCACTGGCTT CTAGTAAAAC TAGCCCCTTA GCAAGCTTCT TTTCAACCCT TTTGCTGACT 17160 

CTTATTCTAT TCATAACTAC CATTATATCA AACTTTTAGA CAATTCTCAA AAAAGAAACT 1722 0 

ACCCTTGCTT TTTTACTCTT CTTTTAAAAA ATGGTATACT AGACTTCCTG CAAAACTAGG 17280 

AAGTAAATGT GTAAGAATCA CAGTAAAAAA TGCTCTTCCG TCTTGGAGGA GCATTTCTTT 17340 

TTATCAACGA AAATCAAATA GCAAACTATG AAACTAGCCT CAGGTTAACT GTGAGATTAT 17400 

AGGTAGAGAG GTTGTATCAG CAATATGTGT CTGTCAAATT TAGTGACAAA GGTAGTAGAA 174 60 

GAAAGATAAA GAAATAAATC AGCTTCAGTA GGTATCTGGA AAATTTGATT TTATAGAGAA 1752 0 

GCCTTTTGTT ACAAACTCAA TATACTATCA ATAAATAATA TTATAGAAGC AACAATAATT 17580 

ATAATTTCAC CTATCTGCAT CATTCTATTT CGAACTCTAA ATATATGTTC TATCAAAAAT 17640 

ACTTGGAACA CACACATTAT AGGAATTAAC GTTTTTGAAA TTGAAAAATA TCCAAATAAA 17700 

TAAACTATAA ACAACAAAAA TAGAACTATG TTATATTTCT TATTCAAAAC ATTCCTCCCT 17760 

ATATATTTTT GATTACCAAT CTTAATCATT TACAACTACA TTCTAACAAA CTATAAAAGC 17 820 

GTTTGTCGAA TTGAATTTAT CAAGCAAGCG ACCAACCAGT TCATCTTTTT TCTATTTCTG 17 880 

CCAATATGCG TGACAGGTAA TAATGATAGC CAAAAATAGC AAGAGCAAGC AAGACGATAA 17 940 

GAGCTCCTAC TCCCAAGCTG ATGGCAAGGA TAGGGGAGAG AGACTGAACC AAGAATATGC 18000 

TCCCAATTAC AAGGGCCATC AGGATTGCAC TATAAATAAA CAATAAAACT ATGGCGAC XA 18060 

TGCCATTTGA ACGATTCACC AGGTCCGTAA TGCTACTCCA ATTGGTTGAC AGATTTTTAA 18120 

CGTCCTTAAA GTAATGGTGG CAAGAAAGGA TGACACTGGC AATGATCCAG ACTACAAGAA 18180 

GGTAAATCAT CGAAATGATG GGCAAGCCTA GATATAGAGA AAGACCAAGC AAAGTCAGAA 18240 

CTGGTAAAAA GGACTGGACA GCATATATAA TCCAAAATTT CACTTTCACA TAACGAGCAA 18300 

AGTCAAAGGG TAAACTCTTA AGAAAATCAA CATTTTCCCT CTCCAAGGAC AAGGCAATTG 18360 

AATGCAGGCT GGTGATATTG TTATTGACAA CTGCTATAAA GAGAGCTATA AAAAACAAGG 18420 

GTAACCAGTA TGGAGGATGA ATGTCTGGAA CTATCTGAGA ATCTCGGATT TTGGAAATCA 18480 
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GACCGATCAT CATGAGATAA GGAAGGAAAG CACTTGTAAA AAGCACTGTA ATCACGCCAG 18540 

TCCCCTGTCC CAAGAGGGTG AGGTGGTAGC GTAAAACCAT GCGAAAAAAT CCCTTTTTAG 18600 

TGGTTGAAAT TCTCTCCTTG CTGCGACGTT CTTTTTTGAC CTTCTCCTCA CTATTAAGCA 18660 

GGATCACGTC ATAAAAACGA GGAAGGACCT TCTTTTTGGT CAGATAAAGC AGGAAGAGAG 18720 

TTAGTCCTAT CCAAGCGAGC AGACCCACTA AGGCTTCTGT CGAAAAAGGC TCCACTGCTA 18780 

TTTTGTAAAA GATATGAAGA GGATAAAGGA GAAATGGAAT GTCTCTAACT TTGTCAACAA 18 840 

TACTTCCAAA AGTCGACTGA AGAAAGAAGA TAAATATTAA AGGTATGAGA ACTCCTATCC 18900 

CAATCATCAC ATTCGAAAAA ATAGACTGAT ACTTTCTGAA GACCCTAGTT TGAGCCAAGA 18960 

AATGCACTGC CACTACCATC ACTAGAGCCA CAGAGACAAA TAATAAGGTC AAGGACAGTA 19020 

GCATCAAAGG CAAACCCAGC CATAGAGAAG GAGCTAGCCT AATGTAGAGG ACCAGAAAAT 19080 

AAGCTAGGAT TGGTACAATT CCAGTTAGAG CTGGCAAAAG GACAGACAGT CCTTTAGCAA 19140 

TTATAATCTC TGATTCTTTA AAGGCATAGG GCCTATACGA TACCAAATCC TTACTCTCAT 192 00 

AAAAGACATT GTAAAAGGCC GTTAAAGAAG TTGAAAAGGC AATCACTAGT AAAATAGCAA 192 60 

TCATCGAGCT AAAATAAATA GGTATTTCCT CAAAAGGAAA ATGAATGGCT ATATTACTAA 193 20 

AACAGATGAT CATCAAGAGA CTGGAAAAAA TGTAAGAACT TAAGACTCTA GCGGAAACAT 193 80 

TTACTTTTTT 193 90 
(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

CCGAGCGTCG TTACAGACTT TATCAAGATT GGACGCAAGA AGAAATTCAA CATATAAAGG 6 0 

AAAATATGGC ACAATCTCCA TGGCATACTC ATTACCATGT TGAGCCAAAA ACAGGACTTC 12 0 

TCAACGACCC AAATGGCTTT TCTTACTTTG ATGGCAAGTG GATCCTCTTT TACCAGAATT 180 

TTCCTTTTGG TGCAGCCCAC GGTTTAAAAT CTTGGGCACA GCTAGAAAGT GATGATTTGA 24 0 

TTCACTTTAA AGAAACTGGA ATCAAAGTTT TACCAGATAC TCCATTAGAT AGCCACGGTG 300 

CCTACTCTGG TTCTGCCATG CAATTTGGCG ATAACTTATT CCTATTTTAT ACAGGAAATG 3 60 

TTCGCGATAA AAACTGGATC CGTCACCCAT ACCAGATCGG TGCTTTGATG GACAAGGAGG 420 
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GTAAGATTAC AAAGATTGAC AAGATCTTGA TTGACCAGCC AGCAGACTCT ACTGACCACT 480 
TCCGCGATCC ACAAATTTTT AACTTTCAGG GTCAATATTA TGCCATTGTC GGCGGACAAG 540 
ACTTGGAGAA AAAAGGTTTC GTTCGTCTCT ACAAGGCTGT CAATAACGAC TACACAAACT 600 
GGCAAGCAGT TGGCGACCTT GACTTTGCTA ACGACCGTAC TGCCTACATG ATGGAATGTC 660 

CTAATTTGGT CTTTGTAGAG GAACAACCTG TCCTTCTCTA CTGTCCACAA GGATTGGATA 720 

AGAAAGTTCT AGACTACGAT AATATCTTTC CAAATATGTA TAAGATCGGG GCTTCCTTTG 780 

ACCCTAAAAA TGCCAAAATG GTAGATGTGT CTCAACTTCA AAACATGGAT TACGGTTTCG 840 

AAGCCTATGC AACTCAAGCC TTCAACGCTC CTGATGGGCG TGCTCTAGCA GTTAGCTGGC 900 

TTGGTTTGCC AGATGTTTCT TACCCATCTG ACCGTTTTGA CCACCAAGGA ACCTTCTCTT 960 

TGGTCAAGGA ACTCACTATC AAAGACGACA AGCTCTACCA GTATCCAGTC GCTGCTATTA 1020 

AGGACCTTCG TGCTTCTGAA GAAGCCTTCT CAAACCGTTC CCAAACCAAG AACACTTACG 1080 

AACTTGAACT CAACTTGGAA GCTAATAGCC AGAGCGAGAT TGTCTTACTT GCTGATAAAG 1140 

AAGGTAAGGG ACTTTCAATC AACTTTGACC TTGTAAACGG TCAAGTAACA GTGGATCGTA 12 00 

GCCAGGCTGG AGAACAGTAT GCCCAAGAAT TTGGGACAAC TCGTTCTTGC CCTATCGAGA 12 60 

ATCAGGCTAC TACTGCTACA ATCTTCATCG ATAACTCTGT CTTTGAAATT TTCATCAATA 1320 

AAGGAGAAAA AGTATTTTCT GGTCGTGTCT TCCCACATGC GGACCAAAAT GGTATCCTGA 13 80 

TTAAATCTGG AAACCCAACT GGAACTTACT ATGAATTAGA TTATGGTCGC AAAACTAACT 1440 

GATGTCGCCA AACTTGCAGG CGTCAGTCCT ACTACCGTTT CTCGGGTTAT CAATAAAAAA 1500 

GGGTATCTAT CTGAGAAAAC CATCCAAAAA GTCAATGAAG CCATGCGAGA ATTGGGCTAT 1560 

AAACCCAACA ACCTGGCTCG TAGTCTGCAA GGAAAATCAG CTAAGTTAAT CGGCTTGATT 1620 

TTCCCCAATA TTTCCAATGT TTTCTATGCA GAATTGATTG ATAAATTGGA ACACCAACTC 1680 

TTCAAAAATG GTTACAAGAC CATCATCTGC AACAGTGAAC ATGATTCTGA GAAGGAACGC 1740 

GAATACATCG AAATGTTGGA AGCCAATCAG GTGGACGGCA TCATTTCTGG TAGTCACAAC 1800 

CTAGGAATCG AAGACTACAA TCGTGTGACA GCGCCGATTA TTTCCTTTGA CCGAAACCTA 18 60 

TCGCCAGACA TCCCTGTCGT CTCCTCTGAC AACTATGCTG GTGGGGTTCT TGCTGCCCAA 192 0 

ACCTTGGTCA AGACAGGTGC CCAGTCTATC ATCATGATTA CAGGGAATGA CAATTCTAAT 19 80 

TCGCCAACCG GACTGCGCCA CGCTGGTTTT GCATCCGTAC TCCCAAAAGC TCCTATTATC 2 040 

AATGTTTCCA GTGACTTTTC TCCCGTCAGA AAAGAAATGG AAATCAAGAA TATCTTGACC 2100 

CGGGAAAAAC CAGATGCCAT TTTTGCTTCG GATGATTTGA CAGCTATTCT GGTCATTAAA 2160 

ATCGCTCAAG AATTGGGCAT TTCTGTCCCA AAAGAGCTCA AGGTCATCGG CTATGATGGG 222 0 
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ACCTACTTTA TCGAAAATTA CTACCCTCAA TTGGCTACTA TCAAGCAACC TTTGGAAGAG 22 80 

ATTGCTTGTC TCACTATTGA TCTTCTCTTG CAAAAGATTG AAGGCAAGGA AGTCGCCACA 2340 

ACTGGTTACT TCTTACCAGT TACGCTATTA CCAGGAAAAA GTATTTAAAC ACAAGAAAAC 2400 

TCAGACCGAT TCGTCTGAGT TTTTATGATC TTAAATTTTC GAGATAGCGC TGGGCTGTCT 2460 

CTAGGTTAAA GGTTTTATCT GAGATGAGGC GCTCTACTAG GGGAGCAACT TCAGATTCAC 2 520 

TAGCCCCAGC TAGGAGAGCT AGGGATTTGG CCTGTAGTTT CATGTGGCCT TGCTGGATGC 2 580 

CCGTACTTAC CAAGGCTTTG AGGGCTGCAA AATTTTGAGC AAGACCGATG GACACGATAA 2 640 

TCTGGGCTAA TTCTCTGGCA GAAGGATTTC CTAGTAGATC ATGACTGAGA ACTACACGTG 2 7 00 

GGTTGAGGCC GATAGAGCCA CCCTTAGTCG CTACAGGCAT GGGCAGGGTC ATCTCACCGA 27 60 

CCAATTCTTC TCTTTCAAGG TCCAGCGTCC AGCAGCTAAG ACCTTGATAG CGTCCATCTC 2 820 

GACTGGCAAA GGCATGGGCC CCAGCTTCGA TGGCACGCCA GTCATTACCA GTGGCAATCA 2 880 

AAATCGCATC AATACCATTA AAAATTCCTT TATTATGAGT AGCAGCTCGG TAAGGATCAG 2 94 0 

CCTGCGCAAA CTGACTAGCC AACGCAATTT TCTCCGCAAT CTCTCGTCCT TGATCCTTTT 3 0 00 

GGCGGCTCAA GTAGCGAAAG GCGATGCGAC AGCTTGCAGT CACCAGAGAA TCGGTCGCGT 3 060 

AGTTGGACAG GATTCCCATG AGACTCTGTC CCTGACTGAG TTCTTCTAAG ACTGGTTTCA 3120 

AGGCTTCCAG CATGGTGTTG AGCATATTGG CACCCATGGC TTCCTGGGTA TCGACATGAA 3180 

TATAAACAAC GAGAAAGTCT GGTTCGCCTT TTATCTGCTC GACATGCAGA TCACGCGCCC 3240 

CACCTCCACG TTTAACGATA GAAGGATAGG CTTGATTGGC AAGCTCCAAG AGCTCCGCTT 33 00 

TCTTGCTGGC AATCTTCTCT TGCGCTAGTT TAGGATTAGC AACTTGATAA AGGGCTACCT 33 60 

GCCCAATCAT CTGTCGCTGA TGGACTTGTG CAGTAAAACC ACCTGCACGC TTGATGATTT 3420 

TGCTGGCATA GCTGGCCGCC GCAACCACAG AGGGTTCTTC TGTCACATAG GGAACGGTGT 3 4 80 

ATTCCTGACC GTTGACAAGT ACCTCCGGAA CCAGTGAATA AGGCAGAGAA AAAGTTCCCA 3 540 

CTACATTCTC ACTCAGCTGG TCTGCCACAG TCACGCTCAT CTGTTCATCC TTCTCCAGAC 3 600 

TAGCTTGTCT CTCAGGACTA AGGAGCGCCT GAGCTTTTAA CAGCTCGAGG CGCTCTTGGT 3 6 60 

ATGATTTTTT AGAAAATCCA TTCCAACTTA TCTTCATTAT TTTTCAACCT TGCTATAACG 3720 

GCGTTGGTGG TCGAGAATTT CAACCAAGGC AAAATCTTGA TTTTCATAGC CAGCAAACTG 3 7 80 

GGCAGAGTTA GTTTCATCCA AGTTTACTTC CTCAAAAAAG ACCTTTTCAT AGTCTGCAAC 3 84 0 

GGATAGGGCA GTTCGTTGGT TGAGCTTGTT CAAACGGTCT TTATCCAAAT AAGCTTCATA 3 9 00 

TCCTTCAACC AATTCACCAC TGAAGAACTC AGCCACAGCT CCACTTCCGT AACTATAAAG 3960 
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GGCGATTTTA TCCCCAGCTT TCAAGCTATC TGTATTTTCC AAGAGAGACA AAAGTCCAAG 4020 

GAAAAGTGAA CCTGTGTAGA TATTCCCCAC CTTTTGACTG TAGAGAATAG ACTGGTCAAA 4080 

ATGCTTTTGT AAGAGGTCTT TTTTCTCTTG AGGCAGGCTC TTATCCATGA TTTTTTTCAA 4140 

GCCTTTTAGC GCTAATTTAG GATAAGGCAA GTGGAAACAA ACAGCCGCAA AATCATCCAA 42 00 

AGTAAGCTGG TAGCGTTTTT GATATTCAAG CCAAGTCGTT TTCAAACTAT CCAAGTATTG 42 60 

TTGGGTAGAA TAGACACCAT TTACATAAGG AGTTGTCGAG TAATTTGGTC GCCAGAAATC 43 2 0 

CATGATGTCA CGGGTCTGAG CTACATTGTC ATTATTAAAG GCCATCATGC GTGGATTTTG 4380 

TGTAATCAAC ATAGCTACAC TTCCAGCACC TTGAGTTGGT TCTCCTGGAG TTTCAATACC 4440 

GTATTTGGCA ATATCACTGG CAATGACCAA GACCTTGGAC TCCGGAGAAT TTTCCACATG 4500 

CAATTTGGCA TAATGGAGGG CAGCAGTCGC TCCGTAGCAG GCTTCTTTAA TCTCGAAACT 45 60 

ACGAGCAAAG GGCTGGATGC CCAGCAAGCC ATGCACAAAG ACGGCCGCAG CCTTACTCTG 462 0 

GTCAATTCCT GACTCGGTCG CCACAATGAC CATGTCAACT TCTTGTCTTT CTTGCTCAGT 4680 

TAAAATAGAG TCACTAGCAC TGGCCGCCAA GGTCACGATA TCCTCAGTTA GGGGCGCAAT 47 4 0 

ACTCAATTCC TTGAGTAAGA GTCCTTTACT TAATTTTTCA GGGTCAATTC CCCTCGCTTC 4800 

TGCTAAGTCT TGTAATTTCA AGACATATTG ACTGGTCGCA AAACCAATCT TATCAATACC 4860 

GATTGTCATA TTTACCTCTG TTTTATCATT CATGTAAAAA ATCGTTCTAT ACTATTTTAT 492 0 

CACAAATGGC AGTAAAAGAG AGAAAAAAGA CTTGATTCAC CAAATCAAGC CTCTTATTGG 4980 

TCATCATTTT AAAGAATGAT TAGTTGCTAG AGAGTTCACC GATATAAGTA GCTTTATAAG 5040 

CTCCATTCAC AGTTATCAGC TCCTGGAGGA TCAAATTTCC TGAGTAAGTC CTTCCCATCT 5100 

CATCTACAAA TTTTTGATAA AACTGACTGG TCGGAATTTC TCTGACATCC TTATCAAATG 516 0 

TCTTATCAAG TGTTTTACTA ACCTTCTCAG CAATCAATTG ATGCTCTTGC CATCCACTTT 522 0 

GAAACTCTGA GCCCGAACTA GAAACCATGA CTGGGATAAA CAACAAGGTC AGTAGATTTA 52 80 

CAGACAATAA GGAAAGTAGT AGACTTCCTG CAAAACTAGA ATCCTAGTTC ATGATTGATA 534 0 

ATACCAGCAA TCAAATTCAT TCGTAATCCG AAGCGTTTAC GATGATTTCG ATAGGTTGTT 5400 

GAAAACATTT TAAACGTTTT TACTTTGGCA AAGATGTTCT CAACCTTGCT TCTCTCCTTA 5460 

GATAGCGCAT GGTTACAGGC TTTATCTTCA GCTGTTAGCG GCTTGAGTTT GCTGGATTTA 552 0 

CGTGGAGTTT GTGCTTGAGG ATATATCTTC ATGAGCCCTT GATAATCACT GTCAGCCAAG 5580 

ATTTTACCAG CTTGTCCGAT ATTTCTGCAA CTCATTTTGA ACAACTTCAT ATCATGACTA 564 0 

TAGTTCACAG CGATATCCAA AGAAACAATT CTCCCTTGAC TTGTGACAAT CGCTTGAGCC 5700 

TTCATAGCGT GAAATTTCTT TTTACCAGAA TCATTCGCTA ATTGTTTTTT AGGGCGATTG 5760 
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AATCATTATC GTGTCCTCAA AGCTGAGAGG AGTTCTTGAA 5820 

AAGAGTTACT TCAACCCATT GGCTCCGACG GATTAAGTTG 5880 

AGCCGCAATT TCTTCATAAG TGCGGTATTC TCGCACATAT 5 940 

TTATCTCATT TTTCAGAAAA TTCTTTTATT TCTGTAAAGT 6000 

TTATATAATG ATAGAGTCTG AGAATCACTG TTCCGCTAGC 6060 

AAGCCAACAT GACAACCAAG GTCGCACTTG CCAGTGCTTT 6120 

AAAAGGCAGT TGTTCGGTAG GAGAGATAAC CTGGAACCAG 6180 

TAAAGACCAC AGCAGGTGTC TTATAAAGAA TACTTAAAAT 6240 

TAATGGCTGC AATGAAGGTA GCTACAATGA CATTGGTCGG 6300 

TTAGCCAGAC AGTCATGCCC AAAATCCCTC CAGGTAAGAG 63 60 

GTACGATTAA AAAAGTGATA ATGGCAAGAA AACTTGCTAC 6420 

TTAGTGTCAT ATTAGTTCAT CAATACCAAG GCGACAGAAG 64 80 

GTAATGAGCA GGGATTCAAA CATCTTACTC ATACCAGAGT 6540 

TCACGGACCG CATTGGTCAA GGCAATACCT GGTACAAACG 6600 

ATCAAATCTG CCGTTGAAGG AAAACCTGTG TAGCGAGCCC 6660 

AAGACAAAGG CTCCAGCAAA GGCTGTCACA AAGGGAATTC 6720 

AGGGAAAAGG CAAAACCAAA TAAGGTCGCC ACTCCTGCCC 67 80 

CCGCTAAACA TAACTGAAAA GAAAGGAGCA CTAAAGGTCG 6840 

TTAGTATAGG GAAGGGGTTT GGCTTGCAAG GCCGTCAATT 6 900 

TCAATCTGCC CCCCAACTAG CTGACGAGAA ATCTGGTTCA 6 960 

TTATAAGAAG AGGAGGTCAC GCGCTTCATG CGCAAATATT 7 02 0 

AGATAGCGGC AGGCATGGCA AGGACATTGC AATCCACAAT 7 080 

GAATCATGGT ATCTTCTACA CGATGGATTT CTGAGCCACT 714 0 

CTAGCATAAT CACATCAATG ACGGCATTTA ATTCTTTTGA 7 2 00 

TTTATCAACT CCCTCTATTC TATCACAAAT CCGGACTCAA 72 60 

TCATGACAAA GATTGATTAC TCATTTTGAT TATCCATCTG 7 32 0 

TTGTTTTAGG TTTGTAGATT GAAATCTTGA CTCTAGTCTT 73 80 

CTGCTCTAGG ACTTTGTTCA ACAACCATGC CTTCTGCACT 7440 

CTTCTACAAC TTCTATATTA GCTTCCTTAA TCCCAACAAT 7 5 00 
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TTGAATCAAA TTGTTCTTAG TAAACTCCAA GCTAGAACCA ATGTAACTCG GCATGGCAAC 7560 

ACTTGTAACT TTTTTAGCTA CTGTCAAGAC AATTTGAGTA GGTTTACTCA CATCATAAGT 7 620 

CGTTCCGGCA CCTGGACTTT GTTTCATAAT CGTTCCTGGT TCGCTTTCGC TGGACTCTTC 7 680 

TTCCTCTATC TTAATCAAAT TCTCAGGAAC CTTCTTCTGC TTGAGTTCTG AGATTACTTC 7 740 

TGTAGAGTTC CGTCCAATAT AGTTCCCTAA TTGAATCGTC GTAGCTTTTT TAGCTACTGT 7 800 

CAAAACAATT TGAGTTGCCT TGCTCAAGTC ATAGGTCGTA CCTTCTGGTA GACTTTGCTT 7 860 

CAGGACCGTT CCAGCCTCAC TCTCATTCGA CTCTTCTTCC TCAATTTTAA TCAAATTATC 7 92 0 

TGGAACTTTT TTCTCTTTTA ATTCCGCAAT GACATCAGAG GATTTCCGAC CGACATAATT 7 980 

ACTAATTTGG AAAGATTGCT TGCCTGATGA GACAACCAAA TTGATTTTCG TTCCTTCTTT 8040 

TCGACCAGTT CCAGCGCCAG GATCTGTACG GATAATCCGC CCTTCTTCCA CCTTTTCACT 8100 

AGCCTCTGTC TTCTCCTCAC CAATCTCAAA ATTGGCTTTT TTGAGCGTTG CCTTGGCCTC 8160 

TGCAACTGTC TGACCTGCCA CATCTGGAAT GGCAATGGTT GCAGGAGTTC TGGATAGTAT 8220 

CCAAATAAGA GAAGCTGCCA CCAATACAAG GCTGGCCAAC AAAATCAGGT AACGCATCTT 82 80 

AAATCTATGT TTTTTCGGTG CTTGTGGTTG GTAAGTTTCC TCCGTCACAG CCTGGCTTGG 8 3 40 

GTTTTTGATT GATTTGTGTT CTGTTTGCGC TTGAACCTTA GGAATAGATG TCAAGGTACT 84 00 

CTGAGAAACC TTCGGCAAGG TCTTGGTATC TGCCTTGCTC GTTTCATCAA AGATTAACTT 84 60 

ACTTTCATTT CTACGATTGT AGGACAAGCT ACTAGACAAG TCCACATACA TCTCTGAAAC 8520 

CGAGCGGTAG CGATTGGTCA ACTTTTTAGC AGTTGCCTTG ATAATAACAT TTTCTAAAGC 8580 

CTGAGGTACA GATGGATTTT CTGCAATAAC GGACGGCAGG GGTTTCTGGA AATGCTGGAG 8640 

GGCAATGGTC ACCGCGCTAT CCCCGTCATA AGGGATATGG CCTGTCAGCA TCTCATAGAA 87 00 

AATAATCCCC ATGGCATAGA TATCACTCTG CACAGTCGCC TTCGAACCAC GCGCCTGCTC 87 60 

TGGTGACAAG TAATGAACTG AGCCCAACAT CGAGTTAGTC TGGGTCAGAC TTGTCTCTGC 8 82 0 

AAAGGCTACA GCAATCCCAA AGTCTGTGAC CTTGGCAGTC CCATCTGGTG TCAAGAGGAT 8 8 80 

ATTTTGAGGT TTCAAGTCCC TGTGAACAAT TCCTCGAGTA TGGGCCAAGC GCATAGCCAA 8940 

GAGAATTTGT CCCATGATAC GGACTGCTTC TTCATTAGAA AGAGGATAAT GTTCCTTGAT 9000 

ATAGCGTTTG AGGTCCAGTC CAGCCACATA CTCCATAGCT AGGTACTGTT GACCGTCTTC 9060 

CTCGCCAATA TCTGTTATCC GAACGATATG AGGATGGTCT AGATCTGCCA TAGCTCTCGC 912 0 

TTCACGCTGA AAACGAGCTA CAGCTATCGG GTCCGTCTGG TAGTTGGTCC TCAGAACCTT 9180 

CACTGCCACT TCTTCCCCAT CTAAGATTAA GTCTTTGGCT AGGTAGACAT CCGCCATACC 9240 

TCCTCGACCA ATCTGTTTGA CAATCCGATA GCGTCCGGCA AAAATCTTGC CGATTTGGAT 930 0 
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CATTCTGCAT CCTCCTCGTT CATAGAAACA AGGGCAACCG TAATGTTGTC TAAACCTCCT 93 60 

GCATTGTTAG CAAAACGAAC AAGTGTCTCC GTTTTATCTG CTAAAGGAAT ATCACTGGTT 942 0 

ACAATATCAC GAATCTCACT GCCTGAAATC ATGTTGGTCA AGCCGTCACT ATTGAGCAAG 9 480 

AGATAGTCAC CTGACTCAAG GATAACTGTC CCAAAATCAG GCTGAATTTC ATCTTTTTGC 9540 

CCAATAGACT GGGTGATAAT ATTTTTTTGC GGATGAGCTT CTGCCTCTTC TGGTGTCAAT 9 600 

TGACCAGCCT TGAGCAATTC ATTAACCAAG GAATGATCGC TCGTCAACTG ATGGTATTCT 9660 

TCTCCACGAA TCAAGCCGAT ACGCGAATCA CCAATATGAG CATAGATAGC CTGATTATCA 9720 

ATAATAGCAA GGACTTCCAA AGTAGTTCCC ATGCCTCTGT AAGCTTCATC CTGACCAAGC 97 80 

TGGTGAATCT TTTGATTTTC AATTTCTAGG TAATGGGCGA ACCATTCACG CACTTCATTG 9840 

ACTGTATCGA TCTGGGTATC AACCCAAGCT ACACCCAGGT CTGTGACCGC CATTTCACTA 9 9 00 

GCGATATTCC CTGCGCGATG ACCTCCCATC CCATCAGCTA AAATAATCAT GGTACGTCCA 9960 

GCTCTATTGA CATAGTGGTT GACATAGTCT TGGTTATTTG TTCGTTTCTG ACCAACATCT 1002 0 

GTTAATAATG AAATTTCCAT GTGTCAGTTC CTTCCTAATC CGATATCTTG CGAAATTGAC 100 80 

TGATGAAGAA TCCATCACTT CCATACAATT CAGGTGTAAT GAGGATACAG CCGTCTTTCA 10140 

TGATATCCTT ACATTCATGT TCTAGTTTTA CCTGCTCGAA CTCGGGATGA CTCTCTAAAA 102 00 

AGGCCTTAAC GACTTGAAAA TTCTCCTCTG AGACGATAGT GCAGGTGCTA TAAGTTATTA 102 60 

TACCACCTTT GCCTAGTATT TGACAAACAC TACCTAATAT TTCTAACTGA ATTTCCTGCA 1032 0 

AGGACGCGAA ATCTGCCGTT TCTTTATTGT ATTTGATATC TGGTTTTCGG CGCAAAAGAC 10380 

CGATTCCTGA ACAAGGAGCA TCCACCAAAA TCTTATCAAA GGAATCCTGG TCAAAAAACT 10440 

CATGCACCTT TCTGGCATCC AATTTTTGAG TTTGAACCCG ATCTTCAACT CCCAGACGTT 10500 

GGGCATTTTC TTGAATTAAA TCCAACTTGT GGTCGTACAA GTCCAGAGCA GTAACCTGAC 10560 

CTGTCGTAAG ATAAGAGGCT ATATGGGCTG TTTTCCCACC TGGAGCCGCA CAGGCATCAA 1062 0 

GCACTCGCTC ATCACCTTGT AAATCAAGCG TCGGAGCAAC CAGCTGACTG GACTCATCTT 106 8 0 

GGATGGTAAT GGCTCCATCC GCAAACAAAT TATGCCCTGC AAAATGCCCC TGCTCCTTAA 1074 0 

CCAGACCAGT GGTTGCTAAA AGGGAATTAT TCGCCTCCAA CAAGGCTTGG ATTTCCTCTT 108 00 

TTCGACTTAG GTCTGTTACA CGAATACTGG CTTTGTTTCG CACTAACAGG CTTTCAAAGA 108 6 0 

TGGCTTTTGC TCTCTCCTCT CCGTATTCTT CCTTGAGTTT GGCAACTAGC CAAACTGGGA 1092 0 

GAGAATAGGC AATGGAGTCA CGCTTGTTTT TTCGCTTGAT GCTAGCAATA TCTGGCCAGC 10980 

CTTCACGCAA GATACGGCGA AGGACAGCGT TGACCAATTT TTCACTGCCT TTTTTACGGA 11040 
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GTTTGGCCAA TTCCACTGCT TCATTAACCA CAGCATGATC TGGAATCTTG TCCAAATAGC 11100 

GGAGTTGGTA GGCACTCATG AGAAGAAGGA CATAGAGCCA GCTGTCTAAC TGGTCTCTGT 11160 

CTTCGATAAA GTGGGATAGG TACCATTCCA GAGTCAGTTT ACGGGCTACC GTTCCATAGA 11220 

CCAGCTCGGT CACTAAGCCC TTGTCTGCTG CCAAAAGTTG ACTTCCCTTT AGATGCTTAT 11280 

TTAAGGCGAT ATTTGAATAT GCTTGGTTCA CAAAAACATC CTCTAGCACT GCTAGAGCTA 11340 

AACTTCTAGC CGTTTCTACT TTAGTCACCA AATCGTTCTC CTACAGTCAA TGTACGTCCA 11400 

ACTCCGTTGA GGAAGGAAGC AATGTCCATC TTAGGCTTAC CAGCTGGCTG CACTTGTTTG 114 60 

AGGGATAGAG CCCCTTCAGC CGTTGCGACA ATCAATTCTT TCTTGCCGAT AGAGAGAATC 11520 

TCACCTGGAT TTCCCTGACC TTCTACTGGT AGGGCTTCAT AAATCTTAAA GCGGTCGCCC 11580 

TTAAGGAAAG TATGGGCAAC AGGCCAGGGG TTCATTCCAC GAATTTGGTT AAAGAGTTGA 1164 0 

CGATTGGTTT TGTTCCAGTC CAGTTTTTCT TCCTCTGGCT TTATATTTGG AGAGAAGGTA 11700 

ACCTGACTCG TATCCTGCGG TTCAGGTTTG ATATCACCAG CAATATAGGC AGGCAGAGTG 117 60 

TCCAAAAGCA AATCACGACC AACTAGCGGC AA H'T'ITCAA ACAAGGTGCC AACATTGTCC 11820 

TCATCTGTGA TCGGAATGCT GCGACGAGAA ATCATATCTC CTGCATCCAT TTCCTTAACC 1188 0 

ATTTCCATGA TGGTCACACC AGCTTCCTCA TCCCCTTGAA TCAAGGCATA ATGGATAGGC 11940 

GCACCACCAC GGTGTCTAGG AAGGAGGGAG GCATGAACGT TGACAGCAAA GTCCATGCTA 12000 

TCAAGGAGTT TGCTTGGGAG AAACTGCCCA AAAGCAGCAG TCACAATTCC ATCTGCTCCT 12060 

AGCTTCATAA GATCTTCCAT CTCTGGACTT CCAGATAATT TTTCAGGTTG GTAGATAGAT 12120 

AGTCCTGCTT CCTTGGCAGC CTGCTTGACT GGGGTTTCTT GGATAACTTT TTTACGACCA 12180 

ACAGCACGGT CTGGCTGGGT CACAACGGCT AGAATTTCGT AACGGTCATC TGTCAAAAGT 122 4 0 

CCTTTTAAGA CTGTTGCTGA AAAGTCGGGG GTCCCCATAA AGATTAGTTT TGTCATATCT 123 00 

TCTCCTTCTT ATAAAAATTG CTGCGGCTCA TGGTCAATGC TGAGACGGAG CTCACTATTT 123 60 

TCCCGTTCTT GAGTCAAGGC TAAAACCTGG TTGAGGGTCG ACCCCAGCTC ATCTTCTAAA 1242 0 

CGGTATTTAA TTAAAATCTG GTAATGATAG AGGTTGTGGG TACGGGCAAT CGGTTTTGGC L2480 

GTTGGCCCCA GAATGGGACT GGTCTCTGAC AAGCCTGACC GCAAAATGTT CATGACTTCA 12540 

TAGGCACGTT TGAAAACCTC TTCTTCTTTC TTGTGAGAAA GGGTAATACC AATCGTGAAA 12600 

TAGTAAGGTG GATAGCCGAG TTGTCGTCTG ATTCCCATTT CATAGGCATA AAAGCCTTCG 12660 

TAATCTTGAT CCTTGGCAAA TCGAATAGCA TAGTGCTGCG GATTGTAGGA CTGTATCAAG 1272 0 

ACTTGACCTG CCTTTTCAGC CCGACCTGAT CGACCTGCCA CCTGAGTCAA GAGCTGGAAG 12780 

GTTCTCTCAG AAGAACGGAA ATCAGGCAGA TTCAAGGCCG TATCCGCATT TAGAACTCCG 12840 
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ACTAGGGTAA CATTGGGAAA ATCCAAACCC TTTGCAATCA TCTGAGTACC AAGTAAAATA 12 900 

TCCGCTTCCC CTCGCCCAAA CTGGTCAAGC AAGGCTTGGT GACTGCCTTT CTTTCGAGTC 12960 

GTATCCACAT CCATCCTCAG AATGCGAGCT TGGGGAAAGA GTTCTGCTAG CTCATCATAA 13 020 

GCCTTCTGAG TTCCCGTCCC ATAGTAACGA ATACTGCGGC TCTTACAGTT AGGACAGACC 13080 

TGAGGAATAT CCTTCGAGAA ACCACAATAA TGGCAGTTCA TAGTCTTGGT ATCCATATGC 13140 

AAGGTCAGAG AAATATCGCA GTTGGGACAA GTATCCACCG TCCCACACTC CCGACACATG 13 2 00 

ACAAAGCTAG AATAACCACG GCGATTGAGC ATGAGAACCA CCTGCTCTTT TTTAACCAGA 13260 

CGGTCTTGGA TAGCCTCTAG CAAAGGAGGC GTAAAGTTTG ACGTCTCATT TTGTCCGATA 13 320 

TAGTCTCGAA AGTCAATCAC TTGAACCTCA GGGATTGTAG CCAAAGGATT GGCACGTTGG 13 3 SO 

GTTAGACGTA AGTGTTGATA GACGCCTTTG CCAGCACGTG CCCGGCTCTC TAAGCTCGGC 13440 

GTTGCAGATC CAAGTACCAG AGTTGCTTGA TTATACTGAG CCCGTAAAAT AGCTACCTCT 13 500 

CTGGCATGGT AACGGGGATT GCTGTCCTGC TTATAAGCCG CTTCATGCTC TTCATCAATA 13 5 60 

ATCATGACAC CCAGATTTTT CAGAGGAGCA AAGATAGCAG ATCTGGCACC AACAACAACT 13 620 

TGGGCATCGC CACGCTCCAC CTTGCGCCAT TCATCATACT TTTCACCATT GGATAATCCT 13 630 

GAGTGAAGAA TGGCTACCTT GTCCCCAAAA CGTGCTATAA AACGCTCGGT CATCTGAGGA 137 40 

GTCAAGGAAA TCTCAGGTAC CAGCAAAATA GCTGTCTTGC CCTTATCCAG GGCACCTTGG 13 800 

ATAATCTGCA AGTAAACCTC GGTCTTCCCA CTTCCTGTAA TCCCTTGAAG TAGAAAGGGA 13 860 

GGTTGAGAAC TGCCAATAGA ACTCACAACC GCATCACGCG CCTGTCTTTG TTCTGGATTT 13 920 

AACTCCAAAG GTCTACTTGC TTCAATTCCT TCAAAATAAG CAGCCGAGCG TTGAACTTCC 13 980 

TTTTGGACTA TGGTAACAGC ACCTTGATCC ACAAAGAAGT TGACTTGCTC TCGCGAGTAG 14 040 

GACTCTAACA AGCTAGCCAA GGAAGCGCTC TCTGGATGAG ACAGCAGATA ATCTCTCAGT 14100 

TCCAACTTTT TCTTGGCACG TGTAGAAATC TCAACACCTT CTAATTGAGC ATGGTCAACC 14160 

TCATACCAAG ACTGGGTCTT GACCTTCTTT TGATCGACTG CCTGATATTC CAGACCAAGC 14220 

AGGCCTTTTC TAGTCAAACG CATCATTTCA GCTTCCTTGG CAAGGTCTAG TGAAGAAAAG 14280 

GCTAGCGAAT CTTCTGAACC AAACAGGCGC ACTCGTTCTT CCTGACTCAA GCCTTCCAGA 14 3 40 

GGATAGAGAA TCTTGTCATA GCTAGAATTC AGAAACCCTG GAAGCATGGC CTTGAGGATA 14 4 00 

GAGATTTTGT AGGAGAAGAC AGATTTGCGT AACTCCTCAG CCAGCCAGAG TTGTTCTGGC 144 60 

GTGAGAACAG GAGAAAAATC CAGCACCTCT GCAATATCTT TTAAATCTTG CTCCATCTCT 14 52 0 

TCTCCATCTG ATTGGGACTT CAAACCAAGA ACAATCCCTT GAATCAGGCG ATTACCCTTA 145 80 
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CCAAAAGGCA CATGAACCCG CATCCCAACT TCCAGCATTC CCTCAAATTC CTCCGGAATC 14 64 0 

CTGTAACTAT AGGGCTGGTC CGTCTGCATC AAGGGCACAT CTACGATAAT CTTAGCTAGG 147 00 

GCCATCTTCT CACCTCCTCC TTGTCAGTAC ATTCTTGCAA TAGAAAAAAT AAGATTGAGT 147 60 

CCCCCCAACC TTAAATTTTT TCACCATCTT CTTTTTCTTT AGCAATTTGC TCTTTGATTT 14 82 0 

TCTTTTCTTC TTCTTCTTTG CGGCGTTTTT CTTCTTCGAT ACGGCGACGC ACTGCTTCAC 14880 

GTTTTCCTTC TGGATCTGGG TGAATTGTAA CGTTTCCTGA TTCGATTTCT TCTAAAGCGC 14 940 

GAAGAGTTGA TTTTTCAGAC TTGAAACCTT GAGTTGCTGG GGCACCTGCT TCCAATTCGT 15000 

GGGCACGTTT TGCTTCCAAG ATTACGAGTG AATATTTTGA AGGAACCTTG TCGAGCAAGG 150 60 

TATCAATAGA GGGTTTTAAC ATCATTTGCT TGTACCTATT TTCTAAATTT TATCGGGTAG 1512 0 

TTGGAGATTT TGGTAACATC TCCTGATAGT GACCAATGAC ACGATCCACA CAGAAGTGTT 15180 

CTGCTTCAAT CACACATTTG ACACGTTCAG CAGCTAGGGG TACCTGATCG TTGACAATCG 15240 

CATAATCATA CTCACGCATG AGGGCAATTT CTTCCTTGGC CTTTTCGATT CGTTGGGCAA 15300 

TCACTTCTGC ACTATCTGTT CCACGACCTA CCAAGCGATC TTGCAATTCA TCCAAATCTG 15360 

GTGGTGTCAG GAAGATAAAG ACAGCATCTG GAACCTTTTT CTTGACCTGA AGAGCACCCT 15420 

GAACTTCAAT TTCAAGGAAA ACATCGATTC CCTTGTCCAA GGTTTCATTG ACATAGGTCA 15480 

GAGGAGTTCC ATAGTAGTTA CCGACATATT CTGCGTATTC CAACATCTGT CCTTGACGAA 15540 

TCAGCTCTTC AAATTCTTCA CGAGTACGGA AGAAATAGTC AACACCGTCC ACTTCTCCAG 15600 

GACGTTGTGC GCGTGTCGTC ATCGATACAG AATATTGAAA TTGGTTTTCA GAACTCTCAA 15660 

AAATCTCTCT TCTAACCGTT CCTTTTCCAA CCCCTGAAGG ACCAGAAAAA ACGATTAGTA 15720 

AGCCTCGGTC TGCCATTGTG TCTCCTTTTA GTCAATCTGT GAAATAACAT TTCTCTAGAA 15780 

TAATGGCAAA AAGCCAGATT ATCCTTTACA GTCTTTCTAT CTAGTGTAAC AAAAAAGCAG 15840 

TAATTTTTCA ACTGCTCTTT CTTATTTATT TAGCATAATC TACTGCACGA AGCTCGCGAA 15900 

TCACGGTTAC CTTGATATTT CCTGGATAAT CGAGATTGTT TTCAATTTTC TTACGAACTT 15 960 

TGTGAGCCAA GATTGTGACT TTGTCGTCCT TGATTTTTCC TGGATTGACC ATGATACGAA 16020 

TTTCACGTCC TGCTTGAAGG GCAAAGCTAG TTTGCACTCC TTCAAAGCCG TTAGCAATTT 16080 

CTTCCAAATC ATGGAGACGC TTGATGTAGC TTTCAAGAGA CTCACTACGA GCACCTGGAC 16140 

GGGCTGCGCT CAAGGCATCT GCTGCAGCGA CGATAACTGC TATCACGCTC TCAGCTTCAA 16200 

CATCTCCGTG GTGACTAGCA ATCGTATTCA CCACAACTGG GGGTTCCTTG TACTTACGGG 162 60 

CCAATTCCAT ACCGATTTCA ACGTGGCTAC CTTCAACCTC ATGGTCAATG GCTTTCCCGA 16320 

TATCGTGAAG GAATCCAGCA CGACGGGCAA GAGCCGCATT TTCACCAAGT TCGCTCGCCA 16380 
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TGATACCAGC CAACTTAGCA ACCTCAATCG AATGGCGCAA AACATTTTGT CCATATGAAG 16440 

TACGGAACTG CAAACGTCCC ATAATCTTCA TCAAGTCTGG ATGAAGGTTT GGCGCACCAA 16500 

TTTCATAGGC AGCAGCCTCA CCGTATTCAC GAATCTTATT GTCAATCTCT TGACGGTTTT 16560 

TCTCAACCAA CTCTTCGATA CGAGCTGGAT GTATACGACC ATCTTTGAGC AACATTTCCA 16620 

TAGTCATACG GGCAATCTCA CGACGAATCG GATCAAATCC TGACAAGGTC ACCACTTCTG 16680 

GTGTATCGTC GATAATCACA TCGACCCCTG TCAAACTTTC AAAGGTACGA ATGTTACGAC 16740 

CTTCACGACC AATAATGCGT CCCTTCATAG TATCGTCTGG CAGATGAACT GTTGAGTTTG 16800 

TTGACTCCGC TACATATTCA CCAGCGATAC GTTGCATAGC TTGAACCAAG ATGTCCTTGG 168 6 0 

CCATTTTGTC AGAACGTTCC TTGACCTCTT GCTCAGCTTC GCGAATGCGA CTGGCAATCT 16920 

CCCTGGTCAA GTTTTCCTCT GTCTGAGCCA AGATAATATC TCGTGCTTCT GCCTGAGACA 169 8 0 

GCGCACCAAT ACGCTCTAGT TCTGCTTCTT TTTGTCTTTC GACTTCCTCT AATTGCTCTT 17040 

CACGCGCATC AAGGTTTTTC GCTCTATCAG AAATACTTTG TTCTTTTTGT TCAAGTGTTT 17100 

GTTCTTTACT CGTCAAATTG TCGTCCTTAC GGTCAAGGCT AGTAGCTCTC TCTGTCAAAC 17160 

GACTTTCGAT TTGTTTGAGT TCTTGACGTT CTGATTTGAA TTCAGCGTCC ACTTCTTCAC 17220 

GGTATTTTCT GGCTTCTTCT TTGGCCTCCA ATAGTGCTTC TTTTTTAAGA GACTTGCTTT 172 80 

CACGTTTGGC TTCATTAACA AGTAAATCCG CTTCACGCTO AGCTTGTCCA CGTAAATTAG 17340 

TTGCTTCTTG TTCAGCATTT AAAAGCATCA ACTCTGCAGC TTCCTGAGAT GATTTCATCT 17400 

TAGCTGAGAT GCTGACATAT CCAATGACTA AACCAATGAT GACGGCAAAA ACAGCAATCG 17460 

CAAGCGACAT GATTTCCATG TTTTTACCTC ATTTTATTGT TATTCCGAAT GACATACATT 17520 

CTTTTACATT CTACCATAAA AAAGTGATTT TCACAAACCT AAAATAGAAT ATGTTTTGAG 17580 

GAATTTGGAA CACATTTACC AAAATAAACT TGTTGTTTAG AAATAGTAGT TTAGTAGAGA 17640 

CTTGAGAAAA AGCCTACCTT TCAATAGACT TAGTAATGAT CTTTAAAGGA CAAGAAAGCC 17700 

ACGCTATCTC CATCCATCAT ATAAATCAAG CGATTTTCTG CATCAATACG CCGTGACCAG 177 60 

GCTCCTTGGT AATCATATTT GAGTGGTTCT GGTTTACCTA TTCCTGTAAA GGGATCACGT 17820 

TGAATATCCT TGATTAGTTT ATTGATTCTT TTTAACGTTT TCTTATCCTG ATTTTGCCAG 17880 

TAGCAATAAT CTGCCCAGGC ATCTTCTGTA AACTTGAGCA GCATTTCTTA CTCCTCAATA 17940 

ACATGGACCT GAGTACTTCC AGCACGAACT TGAGCCATTC CTCGCAAAAC CTTATCAGAA 18000 

AGTTCCTTAT TTTGAGCAAT TCTCAGGGTT TCTTGGATAC TATCCCACTC ACTCTTTGAA 18060 

AGGACTACAA TGTCCTCATC TGGATTTTTA TTGACCACCG TCAAAGGCTC AAATTCATCA 18120 
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TTTACCTTCT TCATGTAGTC CTTTAAATGA TTTCGGAATG TTGAGTAAAG GACTGCTTCC 18180 

ATAACCATAC CTCGTTTTAG CTCTTTTCCA CTATTATACA CGAAAAGAAA GAAATTGTCA 18240 

GGAACTTGTA CAAGATTTTC TTTTCTATCT ATTTATACTC AATGAAAATC AAAGAGCAAA 18300 

CTAGGAAACT AGCCGCAGGC TGTACTTGAG TACGGCAAGG CGACGTTGAC GCGATTTGAA 183 60 

TTTGATTTTC GAAGAGTATT ATTCGTAAAA AATCTCAAAA AGCCTACCTT TCGGTAGACT 18420 

TAGTTTGTTT CTATTC 18436 
(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7001 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

ACGTAGAAAA ACTATTTCTA TCACAGATAA TATTCCGTAT GTTGTTGGAG GTATTGAAAT 60 

AAACGTCCTA GGTATCTTTC TCAGTCTATG TGACTTACAA GGGAAAACTC TTTTCGAGAC 120 

AGAAATTTTG AATGAAGATT ATCCTATTTC AGAAATCAAT TCCACCATTA CCAATATGAT 180 

AAAAACAGCT ATAGAGTACG TCCCTTTGGA AACAAAATTA CTTGGATTTG GCTTATCAAT 240 

ACCTGGACAT TATAACAAAG ACTCCGGAAG TATCATTACA AACAACCCCA TATGGGAATC 300 

TTTTAATTTA TTAAATGTAA TTAAAAGATT CAATTTTCCT TTTATTGTAA AAAATAATAT 360 

CGATTGTATG GCTATAGGAC AATACCTTTT TAATCCACAC AATACCCCCG ATAACTTTAT 420 

TTTCCTACAC GCTGGATTAG GTATTTACAC TTCCTTTTTC ACAAAAGAAA AAATAGGAGC 480 

CTCTAAAAAT CCTTATATCG GAGAAATTGG ACACACCATT GTCGAATTGA ATGGGCAATA 540 

TTGTGAATGC GGAAAAAAAG GTTGTTTACA AACATATATT TCGGATGCTT GGTTAATCAA 600 

ACACGCCCAA TTATTATTTA AAAATTCCCA ACTAACTGTA CTAAAAAGCC TTGTAAAGAC 660 

TGAAAAAGAC ATTCATTTAG ACACCCTTTT AACGGCTTAT AATTTAGGCG ACTCCGCTTT 720 

ACGTCAACAA ATTGATAAAG GAGTCAATTT ATTAGCCACT TCTATTGCAA ATCTCCTCCT 780 

CATCAATCCT GCTGATAAAA TCTATATCAA CAGTCAATTG CTTAATTATC AACCTTTCAC 840 

TCATGAAGTC AGGGATAAAA TCCAAGACCA GCTCCACTTC GTTCCCTTTA CTCGTAATAT 900 

AGAAATTGAA ATTTTACCTT ACAACAAACA TCGTGGAAGT ATAGGAGCTT GTGCATTAGC 960 

TATCGTCGCT TTTTTCATAG AACATAGCAA TGTATTACAA GATATTATTT CACCTTAATA 1020 

TATTAGAAAT CTATAGACCT GTTTAAATCA ACTATAACCT GTAGTAGATA TCTCGTATTT 10 80 
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AGACAATATG AAAACAAGAC GACTTCCATA TAGGAAACCG CCTTCTCGCT ATGTTGAGTG 1140 

ATTTATATTA AAATAACTTT TCTTCTAGCT GCATTTTATT ATTATAAAAA CATTCATCAT 12 00 

AACCCCCAGA ACTTAAATAA CAATTTTTAT TCAAGATACA TACTCCTAGA ATAAACTTTA 12 6 0 

TATGAAATTC TCATTTTTGT TTTTACAATT CTCCTTAGTT AAATCTTGTT TAATATATGT 132 0 

TTTACATATA GTATTTAGCG CCACATAGTA CTGAACTCTC TCCAAAAACG GTTATTCCTC 1380 

TTTGAATAGG GCGTTATCAC AAGAAAAGCA TCTCCACGTT TCAACTTCAT ATGGCTCAAA 144 0 

AACAATCAAT TGATGCTAAA ACCTGTACCT AGATGTTTCG GTTCATAAAA CCATGAAACT 1500 

GTAAAAGTGG ATGAAATTGA TAGCGATAGT CAAATCAAGA GGCATCATAA CTCTAAAAAG 15 60 

TCACAATATA TAAGTTCATC CTCGGAAAAA TATCATTCTA ATTGTTGAAA TGCCTACATG 1620 

AAAAGAAACG TCAAATGCTC ATGAAACAAC GAATACAGGT ATCAAAACTA TGACAAAACA 1680 

AATCCCTAAA TTTACTAAAG ACACTGCTCA ACTTTACACC TGTAAATGGT TGTTGTATAA 174 0 

TAAAGTTACA AAGATGTACG ACCACACTGT TGTAAATCAT AGTGTTCGCG AATATATTAC 1800 

TGATAGCATT TCTACAAATA CAAGTAAAGA GAGCGGATGA GATTCAAACG AAATATGTCA 18 60 

GTGCTTTGGC ATTCCTAGCC TTCATATCAT TTAAAGAATT CTATAGACAA AATTTTTTCC 192 0 

AATACAGACA CTCGTAACAA CTGCTTCATT TTTCTACCAA CATATTTAGG AACAGGATAA 19 80 

GATACAAGAG TATTAATCCA TAGCTCAGTT CTATACCAAT CTAAGACAAA TAAGCTAAAA 204 0 

AAACGATTGA TAATAAGCAA ATAGATTCCA AATTTTCTCT ATCTGCTCAT TTTAATAAAC 2100 

AATACTAGTG TAACTATCCT TCCAGTCAGA AGCTTGTCAA ATCACACCGA AAATTCTTCT 2160 

AAAATTTATC TCGTTAGGCA ATCAAGCAAA AACTCGACGA TAGTACAAAC ATTATCATAC 2220 

AGGATTGACT TCCTAAATTA TATACTTTAG TAAGGTTTTC GGATAAGAAA AAAGGTTCAT 228 0 

TTTACATTTC TAAACATTCT TTTCTAAGAT GAAAAACAGA ATTTTTCGAT TGTGATTTAA 2340 

AGCAACAAGA AGATTTTCAG TATCATCCTA TAGATACGAG CTAATTAAGA AAAACTACAT 2400 

TTTTGAATAT AAACTACAAT AATATAAACT AAATTTTATA GGAGGAAGAC AATGGATTGG 24 6 0 

TACGATTATA TGATACAGGC ATCCAAACAA TCACAATTCA ACGCAAGCCA TTGGTTTCGC 2 52 0 

TATTTGCGAA AAGTTATTTT TGAAGACTAT TCTTATTTAA CAAACCAAGA TGTAGAAAAG 2580 

TTGCTAGACT CCAAAGAACT AACCCGTTTT CAAAAAATTA GCTTGAAGTA TGCCTTTCAA 2 64 0 

GAGCATACTC CAACTCATAA ATATGTGATT TCATTAAATA AACCTGCTAA GTTAACCAAT 2 700 

GTTCAAAAAT TGATGGAGAA ATACAAACAT GGATAAAATG AAACCGGTCT TCCAAGCCCT 2 7 60 

AAATAAGGAA TTAATTCAGG AAAATCTGAC TTTAACAATT ATCTGTGTCG GTGGTTATGT 2820 
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CTTAGAATAT CATGGTTTAC GTGCCACACA AGATGTTGAT GCTTTTATGG CTCTATAATA 2 880 

TTTGTAGTGG GTAAATCCCC TATGGATATT ATGGAGCCTA TTTTTGTGTA GAAAAAAAGT 2 940 

CCCATATGAC CTATAATGAA AAGCGACAAA ACAACTCATT AGAAAGAATC ATATGGAACA 3000 

ATTACATTTT ATCACAAAAT TACTAGACAT TAAAGACCCT AATATCCAGA TTTTAGACAT 3060 

CATCAATAAG GATACACACA AGGAAATCAT CGCCAAACTG GACTACGACG CCCCATCTTG 312 0 

CCCTGAGTGC GGAAACCAAT TGAAGAAATA TGACTTTCAA AAACCGTCTA AGATCCCTTA 318 0 

CCTCGAAACA ACTGGTATGC CTTCTAGAAT TCTCCTTAGA AAACGCCGTT TCAAGTGCTA 3240 

TCACTGTTCA AAAATGATGG TCGCTGAAAC TTCTATCGTC AAGAAGAATC ATCAAATTCC 3 3 00 

TCGTATTATC AACCAAAAAA TTGCGCAAAA GTTGATTGAG AAGATTTCTA TGACCGATAT 3360 

TGCTCATCAG CTGGCCATTT CAACTTCAAC TGTCATTCGC AAGCTCAATG ATTCTCACTT 3420 

TGAGCATGAT TTTTCGCGTC TTCCTGAGAT TATGTCCTGG GACGTTGAAA CAGTCCGGGG 3480 

AGTGACTGTT TCAATCGGGA GATGGAGATG AGCTTTATTG CGCAAGATTT TGAAAAGCTC 3 540 

GATATCATCA CTGTTCTTGA AGGTAGAACA CAAGCTGTCA TCCGAGATCA CTTTCTTAAA 3 600 

TATGATAGAG CCGTCCGATG TCGCGTCAAA ATTATTACTA TGGATATGTT TAGTCCTTAC 3 660 

TATGACTTAG CTAGACAACT TTTCCCGTGT GCTAAAATCG TTCTTGATCG CTTTCACATT 3 72 0 

GTACAACATC TTAGCCGTGC TATGAGTCGT GTGCGTGTCC AAATCATGAA TCAGTTTCAT 3 780 

CGAAAATCCC ATGAATACAA GGCTATCAAG CGCTACTGGA AACTCATTCA ACAGGATAGC 3 840 

CGTAAACTCA GCGATAAACA TTTTTATCGC CCTACTTTTC GTATGCATTT AACCAATAAA 3 900 

GAGATTTTAG ACAAGCTTTT GAGCTATTCA CAAGACTTGA AACATCACTA TCAGCTCTAT 3 960 

CAACTCTTGC TGTTTCACTT TCAGAATAAG GAACCGGAGA AATTTTTCGA ACTTATCGAG 4 020 

GACAATCTTA AGCAGGTTCA TCCTATTTTT CAGACTGTCT TTAAAACCTT CCTCAAAGAT 4080 

AAAGAAAAGG TTATCAACGC CCTTCAACTA CACTATTCTA ATGCCAAACT GGAAGCGACC 4140 

AATAATCTCA TCAAACTTAT CAAGCGCAAT GCCTTTGGTT TTCGAAACTT TGAAAACTTC 4200 

AAAAAACGGA TTTTTATCGC TCTGAATATC AAAAAAGAAA GGACAAAATT TGTCCTTTCT 42 60 

CGAGCTTAGC TTTTTTTCAA CCCACTACAG TTGACAAAGA GCCGGAAAAA GGAACAGCCT 4320 

TAGCTTTCCT TTCATTTCTT TTTATTTCCC TCGTAGTAAA CGTGCTAGCT TCCACAAAAC 4 3 80 

AAACAGGATT CCCAGAAATG CCAGTACCAC TAGCCCACGG TACAACCATT GAGAGGTTGC 44 40 

AACACGCGAT ACAGATTGTC CTTCTTTCGT AAAAGCAACC CTCGCAACTG CAGCTGTTTG 4500 

TGGATCTGAT TTTTGATAAA CAGCGACTCG TTCAAAATTC ACTAATAAGC GTTTATTAAA 4560 

GGTAGGAATC GGATCGCAGG TTATCAAGGT CATGATATTT TTAGAGCTAA CCGATTCTAA 4 620 
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TTTTTCCCAT TCCGACGGTA AAATAATCTC TGTGTCCATC ATCTGATATT CTACAATTTC 4 680 

CTGGCCATTA TCATAATAAA GAGCATCTCC AACTTTTAGC TGATCCAAAT GGCGGAAAAA 474 0 

GACATGGCTT GGCTCTGCAC GGTGCCCAGC AATCACTGAG CGAATCCCTG TACCATCCAG 4 800 

AGGCAGCGGT GTACCATCCA CATGAGCCAA GCCCATCCCT AAATGATGAT AATCTGCTCC 4860 

CAAATAAACC GGCTCCATGA TTTCCAAACT TGGAATAGAC AAGTAACCAT AGACTGCATC 492 0 

AGGGTCGTCA GACACTTGGT AATTGACCTC ATATCCCTCC GCCAAAAAAG GATCTACAAT 4980 

GCGATTTTGC GAAGCCAAGC GTTGATTGTA GGCGAGAGAA TGGTTCTGTT GTTCTTGGTA 5040 

CATTTCAGTT GTCATGGATT TCACAAATGT AGCATGACCT TTCACCTGTC CAAGAGACTG 5100 

CAACACCATC TGTCCAAAAC AATAAATAGG AATCAAACAG GCTACCAACA TCAACAAGTA 5160 

TCCCAATAAG GCTCGTAGTT TAGTCCTTGA CATGACGCCC CTCCAATTGC TTTTCTAGTC 52 2 0 

CTTTGACAAT CCGTCGATTA CGATACACGC GATACAGCAA GAGAAGGATG ACCGCCATCG 52 80 

CTCCTAGTAA TAACCACAAC CAGAATTGCC CACGCTCTCT CACCGCTCGA TTCCGCTCTG 5340 

CAATTGGTGC CGTATACGGA ATCCGCTTCC CACGTACCAA CAGACGATGA CTGTTAATCA 54 00 

TATACGGTGT ACAAGTCAAC AAGGTCGCAT AATCTTCCCC ATGTTGAATC AAGACAGGCT 54 60 

CAAAGTCATT CGGCTCCACC GTCACTATCT GATCCACTTG GTAGGCCAAC ACCTGATCTA 552 0 

AAACGTGAAG ATAAAAGATA TCCCCTTTTT TCATCTTATC CAATTGACTG AACAATTCTG 5580 

CCGTTGGCAA TCCTCTGTGA GCAGTGATCA CTGTATGGGT ATTTTCACCT CCAACAGGCA 5640 

GCGAAGCCCC TTCTAACAGC CCTGCCCCTT TCTGAAGAAT GTCCTCACTC GTTCCGACAT 5700 

ACATCGGAAT TTCCTGATCA ATCGCAGGAA TTTCCACATA GCCAATCCGC TCATGGACCT 5760 

TTAGCATATT GGCATATTCT GAGACGCCTT TCTTTTTCTC TTGCTCTGTA AAAGGATCAA 582 0 

GAATTTCAGA TGGTTTCAAG GTCGCATTGA AGGCTTGAGC CAAGCGCCAA CGCTCCTCAA 5880 

GTTCTGCCTT ATCCATCTGG GAAACCGTCT CATCAAACTC TTTAATAACC TCGTTTGACT 594 0 

CAATACGATA ATAATAACGA GACACCAATG GATATATCGC AACGGCGAAT CCTACTAAGA 600 0 

AAATCAGAAG AAGGATCAGC GGATGTTTCT TCTTTTTTGT GCCTTTTTTT CGTGAACGTC 6060 

TACTGTTGTC CATCCTCCAC CTTCACTTCC TTCCTTGCTG CTTTCAGCGC CTTCAAAGCC 612 0 

TTTTCCGGTT GTTTTTTCTT CTTGCGCAAG CGTCGAATAA TCCATAAAAG AATCACAATC 6180 

AAACCAACTG CCACATAAAA CAGGTAGCGA TAGAGATGAC TGAGTTTGTT TGCTGCAATA 6240 

AATTCTTCCT CAACCTCTGC TACGTACGGT ATCCGATGCC CCCGAACCAA TAGACGATGG 6300 

GTATTGATCA TGTATGGCGT ACAAGTCAGC AAGGTCACAT AATCATGACC TGGTACAATC 63 60 
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AATAAATCAT CAAAGTTCGT CGGCTCAATC ACCTTTACTT GATCCACTTG ATAGGCCATC 64 2 0 

ACTTCCTTGA TATTGTGCAC ATAAAACTTA TCCCCAACTT TAAGTTTGGT CAAATCCGTA 6480 

AACATCTTAG CTGTTGGCAA ACCTGTATGT GCCGTAATCA CCGCATGGGT CGAATTGCCT 6540 

CCGATCGGCA GAGAAGTTCC CTCTAGATGC CCAGCCCCTT GCTGCAATAC CTCTTCAGCA 6600 

GTACCAGCAT AAACCGGCAA ATCCACGTCA ATAACGGGGA TTTCCACATG CCCCATCCGC 6660 

TCATGGATTT CTAACATACG TGCATACTCT GCTCGCCCTT TTTTCTTCAT TTCTTCCGAC 672 0 

CAAGGATCGC CACTCACTAC ATTATTCAAA GAGTCATTGA AGGCTTGTGC CAATTTCATT 678 0 

CGTTCATCAA TGTCAGCCTC ATCCAACGTT GCTTTTTCCT TATCAAAGTC AGCAATTTGT 6840 

TGATTTGATT CCACTCGATA ATACAAGCGA GACACCAGCG GATACGCCAT TACCGCCATT 6900 

CCAATGAAAA ATACCACTCC TAATAGGAGA TTATTTCGTT TTTGCTTTTT TGTTTTTACC 69 6 0 

ATTTTTATCA GCATCCCTTT ATCTTCAAAC TTCAGGGTAT C 7001 
(2) INFORMATION FOR SEQ ID NO: 89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 

GAGGGAGCTT AAGAAGTTAC CACCGTCCTC TAGCGCCTTA TCCGCATCAA AGTTAAGGTT 60 

GATATTTTTA AAACTGTCGC CAGCTTGTGA TACGATGCTT TGTTTAAGGT CATTTAGGGT 120 

TTTAGTGAAA TCTGCATTGC TGAGGATATC ACTCTTTGAG AGATTCAAGG CAAAATTGAT 180 

GATGATATTG ATCTGGTTTC CTGTTATGAC CTGATCAAGT TTGTAATTTT TTAAGGTATC 240 

TTCAACAATC TTGCGGATAT CTTCTTCTGT CAGATTTCCC TTACTTTCTT TAGCTTTGGC 300 

GAGTCCTGAC TTGATATCAG CTAGGGCAAC GTTTAATTTA TTAGCATCAT AGCCTGA1TT 3 60 

GTCCTTGTTT TCAGCATTGA TATCTGACAA AGCTTTTAGC TCTTCTTGAG CCAAATCTTT 420 

ATTAGCTTGT GGCACCTTGG CTCCATTAGC CTCTAGCGAA TAGTAAATCC CTGCTAAAGC 480 

ACTTTCTCCT GTAACTGGAA TAGGGGCTGC TACAGTGATT TTGGCATGTT CCATACCCAG 540 

CGTTACTGCT GCGTTTCGGT ACATATCCTG AGTCACCTTA GTGATATTTT CTGGTGTTTC 600 

AATCTTGACC TCAAGTGGCG ATTTGTCACC TAGCTTTTGA ATCTTGGCTG ATGAATACAA 660 

CTGTAAGCTA GAGTCATTGG CCACATTCAT GATTTTAGAA TAAACATCAG GTGTCATGGT 720 

CTTGAGTTCT TTGGTATCTG TTGAGGCATT GTAGCCCAGT TTTTTAAGAG TTTGATTTTT 7 80 
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TTGGTCTTCA GATAGGGAGG AACCTAGGAC ATATTCAGGT TGGACATAGG TTTCATCGAT 840 

AACTTTTTGA ACATCTGTTG CTGCATGGAC GCTATTCATA GCTGTTACTG CCCACAAGAT 900 

CGCAGCGCTA GTCAGAAAGA GTTTCTTTCT CATAGGGAAT TTCCTCCTTT ACTTCTTTAG 960 

AGTAATATAT CTATCTTAAA GAAAACTTAT AACAAAAACA CCTGGTCTAG CCAGATGTTG 1020 

AAAAGAGAGT GAAACATTTG ATGATGTAAA GGTTAAGTCG TACCTGTCTA GAATAATAAT 1080 

AGTTTCCTCC ATTTACATAG AGTTCAGCAC CGTGAAAAAT GGAAATGGGG TGAATATAAC 114 0 

TATAAGTCTT TCCAGTCCTA TTACCAAGCA AGGGGGCAAC AGTCTCACGA GAGTACTGTT 12 00 

TGGCTAGAGC CAGGGTATTT TCCTTGCCAT TTTGGGCGAT AAAATCGATA TAGGCAGGTC 1260 

CAAAATTATA GGCTTGAACA GCTGTCCAGA TATCTACCCC CTTCTTCTGC GCCAGATAGA 132 0 

GATTGCCTGT CAGAGTTTGA ATGCCTTGCC GAATGCTAGA GGCATTATCA TTGATGGTGT 1380 

TGGTGGAACC ACTTGCAGAC TCACTAGACT GCATAACATC GCCTTCTTTT CCTTTTGTTT 1440 

CAGTATAAAT CATAGCAAGC ACAAGCTCTT CGTTTGCTGG GGTGTCTTGT TCACTCAATA 1500 

TTTCTCGCAC CATGGGTTGA TAGGTCATGA CTTGTTTGAC ATCTTGATGA ACGCGGTAAG 1560 

CTTTATAGCC AGCAAAAAGG AAGACTGCTA GTACAAGCAC TCTTCGAATT CGTTTAAACA 162 0 

TTATTTACTT TGGATATCCT CGATATTTTT GATTAAGATA GAGTAGGTTC CATTTTCGTT 1680 

TTGGATAAAC TCAACAGACT CGGCGTCTTG ATAGACGTTA TTGGGAACGA TGAGCTCAAT 174 0 

TCCATTTGAT AAGGAGAGTT TTTGGTTTTC AAATTTCTTT AATTGGCGAC TGGCATCAAT 1800 

TTCATCAAAT TGAACAGGTT CTGGTACGGC TTCTTTGACT TGGTCAATAA AGCTCAAACG 1860 

AGCCGTCAGA TTGTTGTCAA AAAGGTCATT AGCCAATTTC TCAGGTGACA ATTCATTGCT 192 0 

TTCTTCTAGG TTGTTGAAAA TAGCTGATTT GACCTTGGAT TGAAATTGAA AATCATCTGT 19 8 0 

GTTAAAAGAT TTAGCAATTC TCTGGGCTGT TTTTTCCAGT TCCTTGATAG ATTTTTTAGG 2 04 0 

AGAAATCTTA GGAGCGACAG CAAGAAGATT ATCTGAAAAA TAGTTCAAAA AAGTCCCGTT 2100 

GTACTTGATT CGTTTTTCAA TCAGGTGATA CTTGCTACTC TGAAGATTGA CCACCAAGGC 2160 

CTCATCAGCT CCTGTTCCAA ATCCAGGCAG GTTATTCTGA GTTAGCTTGA TTGGATTATC 2220 

AACTTCTCCT CCGAGGTGGG TCAAGGTCTC CCGCAGGGCA ATTCGCAAGA AAGCGAAATG 22 8 0 

TTCTACACCT TCTTTAGAAA ATTGCACAAA AATCAAGTCA TTGGTCTTGA GATTTTCAGA 2340 

AATGCTAAAC TCCTCTTTCC AGAGATTAGC CAGCGTTACT GATGTCTCCA ACAAATCGTC 2400 

TGTAATATGA TTGAAGAAGG GATTTTCTTC TTCGAAAATC CCAGTCTTGG CTTCATCTGA 2460 

ATACACATGT TCAATTTTTT TACGCAGGTA TTCTTCGATT TTTGGAGTAA TATTGAGAAA 252 0 
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CTTATCTGCT AAGAACAGTT CGGTATCATC CGGACTGAAC TGGTGAATAA TGGCTTTCTT 2 580 

AATATAAATG TCCATAAAAG TTTTAGTCCT CGTATAATGG GAAGGCATCT GTCAATTCTT 2 640 

TGACTGCACT TCTCACTTCT TCTAATACAG CCTCATTTTC TGAATTCTTA AGGGTTTTAA 2700 

TGATGAGTTC AGCCACTTTG CGACTTTCTT CTTCACCAAA TCCACGTGCA GTAATGGCTG 27 60 

CTGCTCCGAT ACGAATCCCA CTTGTCTTGA ATGGTGACAA GCTTTCGTAA GGGATTGAGT 2 820 

TTTTATTTAA GGTAATATTG ACTTCATCCA ACAAGTTTTG AGCAACTTTG CCGTTTTCTA 2 880 

CAACTTTAGT CACATCAACA AGGAAGAGAT GGTTTTCAGT TCCACCTGAA ATAATACGGA 2 940 

AATCAGGGTC TTGCAAGAAG ACATCTGCCA TAGCCTTGCT GTTCTTAATT ACATTGGCAG 3000 

CATATTCCTT GAAGGCTGGA TCCAAAACTT CTTTGAAGGA AACTGCCTTA GCCGCCACAA 3 060 

CATGCTCTAA AGGACCGCCC TGAATACCTG GGAAAATAGC TGAATTGATT TTTTTAGCAA 3120 

GTTCTTCGTC ATTGGTCAAA ATCAAACCAC CACGAGGTCC ACGAAGGGTT TTGTGGGTCG 3180 

TTGTTGTTGT GATATGAGCG TATGGAACTG GGCTTGGATG AAGGCCAGCC GCAACCAAGC 3 240 

CAGCGATATG GGCCATGTCC ACCATGAGCT TCGCACCGAC AGCATCTGCG ATTTCACGGA 3300 

ATTTTGAAAA ATCGATAATT TGAGAATAGG CTGAAGCACC AGCTACAATC AGTTTTGGTT 3 3 60 

TTACTTCTTG GGCTTGTTTC AAGATAGCAT CAAAGTCTAA GAGTTCCGTT TTAGGATCAA 34 2 0 

CACTATAAGA AACAAAGTTG TAGGTTTGAC CAGAGAAGCT AACAGGAGCC CCATGAGTCA 34 80 

AATGACCACC TGATGCCAAA TCCATTCCCA TAACCGTATC ACCTGGCTCA ATCAAGGACA 3540 

TGTAAGCCGC ACAGTTAGCT TGGCTTCCTG AATGTGGTTG AACATTGGCA AATTTAGCAC 3 600 

CGAAAATTTC TTTTGCGCGT TCAATAGCAA GAGTCTCTAC AACGTCTACT ACATCAGTTC 3660 

CACCATAATA ACGGCGTCCT GGGTAACCCT CGGCATATTT ATTTGTCAAG ATAGACCCTT 3720 

GAGCTGCCAT AACAGCCTTG GAAACTACGT TTTCCGAAGC AATTAACTCG ATATTATTTT 37 8 0 

GTTGGCGTTC TTCTTCTTTG GCAATAGCAT TCCAGAGATC AGCATCATAT GCTTTAAAAT 384 0 

CATCTTTGTC AAAAAT CAT A GGTCTTCTCC TTTATTGTGT GACTAGTCCA TTAGTTTGAT 3900 

TTTACAATAA GAAAATCAAA CTAACAGATG CGAATAAACC GTTTCTGCAT TTTATCACAA 39 60 

GTATAGCCAA CTTTTTCATA AAATGCATGA GCACCCAGAC GATGATTGGC AGAATTTAAG 4020 

CGGATAAACC CATAACCACA TCTTTTTGCT TCTTCTTCCA ACCCTTGTAG TAAACTTTTA 4080 

CCAATACCTT GACCTTGCGC TTGAGGTGAA ACTGCTAAAG CTAAGATATT AAATCCTGCT 4140 

TTGGAATAGA GTGATTCGTA AACTTCAGCG TGGACATATC CAAGTAAGAC ATGATTAGCT 4200 

GCATCCTCAT AGCCAAGTAG GAAATGATGG GAATCCTGAG ACAGTCTAGC TAGTTGGCTA 4260 

GCCGTTTCCT CTGGACTAAA AGTATAACCC AAAGCCTCTT GGTTGATGTC ACATATAGCT 432 0 



WO 98/18931 



PCT/US97/19588 



711 

TTCACATCAG TTTCTCTTAA ATCTCTTAGC ATCTCATTCC TCCTCAAAAG AAATCTTTGG 43 80 

CAACCGAGCA AGAATATCTT CTCGCTTAAT GGCCCCTTGA CGTAAGATTT TCACCTTGTC 4440 

TCCCGACAAA TTCAAAATAG TTGAATCCTG TCCAGTTAGA AAAGCATCGT CTTCCAGACC 4500 

CAGAACCTCT TGGTCAAAAT CCTCTAGAAT TTGATTAAAG GTCACTCCAC TCGCCTGACC 45 60 

TGAGATATTG GCAGACGGCC CAATCAAGGG ACCTGTCTCT CGAATCAAAT CAAGGGTAAT 4 62 0 

GGGATGACTA GGCATCCGAA ATCCAACAGT TGCAAGGCCA GAATTGACCC AATAGGGAAC 46 80 

TCGGTCATTA GCTTCGAGAA TAATGGTCAA GGGACCTGGT AAAAAGATCT CTACAAGTTT 474 0 

TTGAAGATAA GTTGGCTGAT TCTTTGAAAA GTACAAGATG TCCTCTAAAG AGGCAACATT 4800 

GAGATTGAGC GCCTTGTCTC TACGTCGACG TTTAAGCTGG TAAACATGGT CAACTGCTTT 4860 

TTCGTCTAGC GCCTTAGCAA AGAGACCGTA AACTGTCTCT GTAGGCAAAA CGACAGCTCC 492 0 

ACCATTTTCC AACTCTTGTC TAATCCTGTC CATCATCAAC GACAACCATC CTATCTTGAC 4980 

CAAATTGGTC CTTGAGTGTT CGTACTCGCT TTTCAGGAAG ATGTTTCCTA AAAAGTTCAG 5040 

GAACACTTTG ACCTTGCTTG TATCCAATTT CAAGGTAAAT CTTACCACCA TCTTTGAGAT 5100 

AGTCTTTTGC ATCTTCCGCA ATTCTACGGT AAATAGCTAG GCCATCCTCA TCTGCAAAGA 516 0 

GAGCTAGATG AGGCTCCGAA TACAAGACAT TCAAGCCTAC CTCTGACTCA TCTTCACGAG 522 0 

AGATATAGGG TGGATTGGAA ACAATTATAT CATATTTTTC AGAAATTTCT GTAAAACAGT 528 0 

CAGATTTTTT TAAAAATATT TGAAGATTTT GATTTTTAGC ATTTTCGCTA GCTACATCTA 5340 

AAGCATCTTG GGAAATATCT GCTGCCGTCA CTGACCAATC TGGTCTGTTT TTTGCTAGAG 5400 

CGAGAGCAAT AGCTCCACTA CCTGTTCCGA TATCTAGGAC CATAAGATTT TTCACAGGAT 5460 

TTTCAGCCAG GATAAGCTCC ACCAACTCCT CTGTTTCTGG ACGAGGAATC AAAACCCGTT 5520 

CATCCACCTT TAAATGCATT CCATAAAAAT CTGCCTGTCC AATGATGTAC TGAGCTGGCT 5580 

TGTGAGCTGC TAGTTGCTGG TAAATATCTT CTACAAATTG TTTTTCTTCC TCTGTTGTCA 5 640 

CCTCCTGCTG GAGGGCAAAA ATAAAGTCTG TAAAAGATAG ATTTTTCAGA CTACGATAGA 5700 

CAAAAGAGAG GCTTTCCGCT TCCTCTCCTT GTCTTATCAA CTCTTCTTCA AAATTTGAAA 5760 

ATAATTGAGC TAATTTCATT ATTTGTTTAA TTCTTCTAGT TTTTGTGTTT GGTCATAAAG 5820 

CACCAAGGCA TCCACAACTT CGTCCAATTT ACCAGACAAA ATCGTATCTA GTTTTTGGAG 5880 

GGTCAAGCCG ATACGGTGGT CTGTGACACG GTTTTGTGGG AAGTTATAAG TTCGGATCCG 5 940 

TTCTGAACGG TCACCAGTAC CGATTGTCGA CTTACGCTCA GCGTCCTGCT CATCTTGAGC 6000 

AATCTGAGCA AAGTGGTCAG CAACACGGGC ACGGATGATT TTCATGGCCT TCTCACGGTT 6060 
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CTTCTGCTGG GTACGTTCTT CCTGCATCTC AACCTTGATA TTGGTTGGCA AGTGAACGAT 6120 

ACGAACGGCA GTCGCAACCT TATTGACGTT CTGTCCACCA GCACCAGAGG CGTGATAGAT 6180 

GTCGACACGA AGGTCTTTTG GATCAATGTC GTATTCAACC TCTTCAACTT CTGGCATAAC 6240 

AAGAACTGTC GCTGTCGAAG TATGAACACG GCCTTGGCTT TCTGTCACAG GAACACGTTG 63 00 

CACACGGTGG GCACCTGATT CATACTTAAG CTTAGAGTAT ACAGACTGAC CTGAAACCAT 63 60 

AGCAACCACT TCTTTAAAAC CACCGACACC ATTCATAGAG GCTTCCATGA CTTCAAAGCG 6420 

CCAACCTTGG GCTTCCGCAT ACTTTTGGTA CATAGTTAGC AAATCTCCAG CGAAAAGTGC 64 80 

CGCTTCGTCT CCACCAGCTG CTCCACGGAT TTCAAGGATG ATATTCTTGT CATCGTTTGG 6540 

ATCCTTTGGA AGGAGCAAAA TTTTCAGTTT TTCTTCATAT TCTTCTTTTT CAGCCTTGGC 6600 

ATCTTTGAGT TCTTGCTTGG CCAATTCTTC CAAGTCCGCA TCTCCGCCTG ATTCCTTAAT 6 660 

CATCTCTTCG GCATCGACGA TATTTTGAAG GACTTGTTTA TACTCACGGT AGGCTATTAC 67 20 

GGTGTCACGA TTGGAAGCTT CTTCTTTTGA AAGCTCCATA AAACGCTTGG TGTCTGAAAC 67 80 

GACATCAGGG TCACTCAGCA ATTCTCCTAA TTCTTCATAA CGGTCTTCTA CAACTTGTAG 6840 

TTGATCATAG ATGTTCATTT TTTCTCCTTA TTTCTCAATT GTTAAATCAT AGATTGCTAC 6900 

TACTTCATTC TCGGATATTT CCCCAGTTTC TTTAAATCCA TAACTGAGGT AACAAAATCT 69 60 

TGCCTGTTCA TTTTCTGGTT CATArGACAA CCAAAGTTTA TTGCTTAAAC CTGCTGGCGC 702 0 

TGTTCGAACA TAGTCTAGTA CTTTATCCAT AATTGGTTTA AAATATCCTT GATTTTGAAA 7080 

ATTCTTATCA ATCATAAAAC GAAATAGTAA ATAATTTCCA CTACTAATTC CGATCTTTTT 7140 

ATCATAAGCT ATCATCACAA AACCTATAAT TGCATCATTA TCATAAACTG CCAATGGAGC 72 00 

TACAAAATCT CCATTTTTAG TGTAGACGTA TGCTTCAGCT AAACTAATTG CGTTGGTTGC 72 60 

AATGAATTGT TTTTGATATT CCTTGACATC CAAATTTAAA ACATCAAAAT AATTTTCCAT 7320 

TGTAACATCT CTTAGTTCAA TTGTCATAGT TTTGCTCCTT GTTAGAGGTT ATCATTGGCG 73 80 

CAAAATAATG TTTACGGCAA ACTGAGATAT AGGTTTCGTT ACCACCAATC TGGATCTG'l'T 7440 

CTCCATCGTA AACGGGCAGT CCATCCTGTG TTCGCAACAC CATGGTCGCC TTTTTCTTGC 7500 

AATACTGACA GATGGTCTTG ATTTCGTCAA TCTTGTCTGC TAAAAGCAAG AGATATTTGG 7560 

AACCTTCGAA CAATTCATTG CGAAAGTCAT TTTTCAAGCC AAAAGCCATG ACGGGTATGT 762 0 

CTAACTCGTC CACAACACGA GCTAGGTCGT AAACATGGTG GCGTTTGAGA AACTGGGCTT 7680 

CATCGACCAA AACACAGTAA GGTTTTTCTG GTAGGTCTCG GATATAGCCA AAGATATCCG 7740 

TTGTTTCCTC AATCGCAAgG GCAGGGCGTT TCATGCCAAT TCGACTCGAC ACATAGCCAA 7800 

CGCCGTCACG CGTATCCAGA GCCGAGGTCA TAATCACAAC ACCTTTTCCT TGCTCCTCGT 7860 
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AGTTATAGGC CACTTTGAGA ATCTCAATCG TTTTACCAGA GTTCATGGTC CCATAACGAT 792 0 

AGTACAACTG TGCCATGTTT CTTGCTTCAC GTCCATTTCT AAATTTTTGC TACATTCTAG 798 0 

TATATCATAA TTTTCTTAAG CTTTAAACGG CAAAATGTGG TAAAATAGAA GAAATCAAAA 804 0 

ACTAGTGGAG GAAGCTATTA TGCCATTTGT ACGCATCGAT TTATTTGAAG GACGCACGCT 8100 

CGAGCAAAAG AAAGCTCTTG CTAAGGAAGT AACGGAAGCA GTTGTCCGCA ACACTGGAGC 8160 

CCCTCAATCT GCTGTCCATG TCATCATCAA CGACATGCCA GAAGGAACTT ACTTCCCACA 822 0 

AGGGGAAATG CGTACTAAAT AAGCTAGCTT AAGCAGAATT GCTTAGGCTT TTTCAATCTC 8280 

CAAGTAGCAT TCATTGAAGA AATATCCTAA ATTTGTTACA ATTTGAAAAG AAACTTGGAG 8340 

AATTTCCAAG AAAAGAGCTA TTAATTAAAG GAAACATTAT GATTACACGT GAATTTGATA 8400 

CCATCGCTGC TATCTCTACT CCACTAGGTG AAGGGGCTAT TGGTATTGTC CGCCTGAGCG 846 0 

gAACAGACAG TTTTGCTATT GCGCAAAAGA TTTTTAAAGG AAAAGACTTG AACAAGGTTG 852 0 

CCAGCCACAC TCTCAACTAC GGTCACATTA TTGATCCTCT GACTGGTAAA GTCATGGACG 8580 

AGGTTATGGT TGGGGCTATG AAGTCTCCAA AGACCTTCAC TCGTGAGGAT ATTATCGAGA 864 0 

TTAACACCCA CGGTGGGATT GCGGTGACCA ATGAAATTCT CCAGCTAGCT ATTCGTGAAG 8700 

GGGCTCGGTT GGCAGAACCT GGTGAATTTA CCAAACGTGC TTTTTTAAAC GGTCGCGTAG 8760 

ACTTGACACA GGCAGAGGCT GTGATGGATA TCATCCGTGC CAAGACTGAC AAGGCCATGA 882 0 

ACATTGCGGT CAAACAATTA GACGGCTCCC TTTCTGACCT CATTAACAAT ACCCGTCAAG 8880 

AAATCCTCAA TACACTTGCC CAAGTTGAGG TCAATATCGA CTATCCTGAG TATGACGATG 8940 

TTGAGGAAGC CACTACTGCT GTTGTCCGAG AGAAGACAAT GGAGTTTGAG CAATTACTAA 9000 

CCAAACTCCT TAGGACAGCA CGTCGTGGTA AAATCCTTCG TGAAGGAATT TCAACGGCTA 9060 

TCATTGGACG TCCCAACGTT GGGAAATCAA GCCTTCTCAA CAACCTCTTG CGTGAGGACA 9120 

AGGCTATCGT AACAGATATC GCTGGGACAA CACGAGATGT CATCGAAGAG TACGTCAACA 9180 

TCAATGGTGT ACCTCTCAAA TTGATTGATA CAGCCGGTAT TCGTGAAACG GATGATATCG 9240 

TTGAACAAAT TGGAGTTGAG CGTTCGAAAA AAGCTCTTAA GGAAGCTGAC CTAGTTCTGC 9300 

TAGTACTAAA CGCTAGTGAA CCACTAACCG CCCAAGATCG CCAACTCCTA GAAATCAGTC 93 60 

AGGAGACTAA TCGCATTATT CTTCTTAACA AAACTGACCT GCCTGAAACG ATTGAAACTT 942 0 

CGGAACTACC TGAAGATGTC ATCCGCATTT CAGTTCTTAA AAATCAAAAC AT CG AT AAAA 9480 

TCGAAGAGAG AATCAACAAC CTCTTCTTTG AAAATGCTGG TTTGGTTGAG CAAGATGCTA 9540 

CCTACTTGTC AAACGCCCGT CACATTTCCT TGATTGAGAA GGCCGTTGAA AGCCTACAAG 9600 
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CTGTTAACCA AGGTCTTGAA CTAGGGATGC CAGTTGACTT GCTTCAAGTT GACTTGACCC 96 6 0 

GTACTTGGGA AATTCTAGGA GAAATCACTG GAGATGCTGC TCCAGATGAA CTCATCACCC 9720 

AACTCTTTAG CCAATTCTGT TTAGGAAAAT AAGAAAAATC CATGATCCTT CATTCGGTCA 97 80 

TGGATTTTAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT TATGGAGCCT 9840 

ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA AACAACTCAT 9900 

TAGAAAGAAT CATATGGAAC AATTACATTT TATCACAAAA TTACTAGACA TTAAAGACCC 9960 

TAATATCCAG ATTTTAGACA TCATCAATAA GGATACACAC AAGGAAATCA TCGCCAAACT 1002 0 

GGACTACGAC GCCCCATCTT GCCCTGAGTG CGGAAACCAA TTGAAGAAAT ATGACTTTCA 10080 

AAAAACCTTC TAAAATTCCT TATCTTGAAA CGACTGGTAT GCCCACTAGA ATTCTCCTTA 10140 

GAAAGCGTCG ATTCAAGTGC TATCACTGTT CAAAAATGAT GGTCGCTGAA ACTTCTATCG 10200 

TCAAGAAGAA TCACCAAATC CCTCGTATCA TCAACCAAAA GATTGCTCAA AAGTTAATTG 10260 

AAAAGATTTC TATGACTGAT ATTGCCCATC AGCTTTCCAT CTCAACTTCA ACTGTTATTC 1032 0 

GTAAGCTCAA TGACTTTCAC TTTAAACATG ATTTTTCTTG TCTTCCTGAG ATTATGTCTT 103 80 

GGGATGAGTA TGCTTTTACA AAAGGGAAGA T 10411 
(2) INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2393 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

GTTTTGGGTT CTGGAAATTA TCAGATGGTT GGAAAAGCCG TCCACATCAA GATAGTGTTC 60 

GGAGATTTAA GTTTAAATTG AAGAAACTAA CACAGAGGAA ATGGAGTATA GACCTAACAA 120 

GACGTATTGA GCAACTGAAT TTGTCTATTC GAGGATGGAT AAACTATTGC TCATTGGGAA 180 

ATATGAAAAG TATAGTCGCC AGCATAGATG AGCGCTTGCG TACTCGCCTA CGAGTGATTA 240 

TCTGGAAGCA ATGGAAGAAG AAATCGAGAC GATTATGGGG ATTGCTTAAG TTAGGAGTTC 300 

CTAAATGGAT AGCAGATAAG GTATCTGGCT GGGGCGACCA TTATCAATTA GTAGCTCAGA 3 60 

AGTCGGTACT TAAACGTGCT ATATCAAAAC CAGTCCTGGA AAAACGTGGA CTGGTTTCGT 420 

GTTTGGATTA TTACCTTGAA CGACATGCGT TAAAAGTTAG TTGAACCGCC GTATGCCAAA 480 

CGGCACGTAC GGTGGTGTGA GAGGGGCTAG AGATTATCCC CTACTCGATT AACTCCCCTG 540 

AAATTTATTT TAATTATGCA AATTTCACGT ATTTTTGATG CTGAGACGAC GATCCTGGGA 600 
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ACTTTTCAGA TATTTTTTTG ACTATCTAAA TCTATCATTA GAAAAGCTTA GAGCGCCAAA 660 
GGATTTGAGC GTTTTTCTGA TTTTTAAGAC TTTTTCCAGT CTCTTTTTCG ATTGAAGATG 72 0 

TAATTATTCT ACTAACTAAC TAACTTCTTA GTACTAGCCA ACAACGATAA TCATAATTCC 78 0 

TCCTAAAATT AGGAATAATA AAGGCAATAG TTTTTGTTTT TTCATGTAAA AAACCTCACT 840 
TTTGTTTTCT GCTATTTTAT GCTAAAATAT TAAAAATCAA ATTTAATTCC AAAGTTTGTA 900 
ACTAAAGGGG GAGCGCTACA TGTCTAATTC ATTTGTCAAG TTGTTAGTCT CTCAATTATT 96 0 

TGCAAATTTA GCAGATATTT TCTTTAGAGT AACAATCATT GCTAACATAT ACATTATTTC 1020 

AAAATCAGTA ATTGCCACAT CACTAGTTCC TATCTTAATA GGAATATCCT CTTTTGTTGC 1080 

GAGTCTTTTA GTTCCGTTGG TTACTAAAAG GTTAGCGCTA AATAGGGTTT TATCTTTATC 1140 

TCAATTTGGA AAGACTATAT TATTGGCGAT ACTGGTAGGA ATGTTTACCG TAATGCAATC 1200 

CGTAGCGCCT TTGGTGACCT ATCTATTTGT TGTTGCAATT TCCATACTAG ATGGTTTTGC 1260 

AGCACCCGTT TCCTATGCTA TTGTGCCACG CTATGCGACC GATTTGGGTA AGGCTAATTC 1320 

AGCCTTATCA ATGACTGGTG AAGCTGTTCA ATTGATAGGT TGGGGATTAG GTGGACTCTT 1380 

GTTTGCAACA ATTGGTCTGT TACCTACCAC GTGTATCAAT TTAGTCTTGT ATATCATTTC 1440 

TAGCTTTCTG ATGTTATTTC TTCCTAACGC TGAAGTGGAG GTGTTAGAGT CAGAAACTAA 1500 

TCTTGAAATT TTGCTCAAAG GTTGGAAGTT AGTTGCTAGA AATCCTAGAT TAAGACTTTT 1560 

TGTATCAGCA AATTTATTGG AAATTTTTTC AAATACGATT TGGGTTTCTT CCATTATACT 1620 

TGTTTTTGTA ACGGAGTTAT TAAATAAAAC GGAAAGTTAC TGGGGATATT CTAATACAGC 16 80 

ATACTCTATT GGTATTATAA TTAGTGGCTT AATTGCTTTT AGGCTATCTG AAAAGTTCCT 1740 

TGCTGCTAAA TGGGAACCCC AATTATTCAC CCCAAATCTA AAAACCATCC AGAATCCTTG 18 00 

CCTTAGCTTA GATCCTGGAT GGTTTCTTTT TTCACCCAAT GGGTGTTTTT TACTAGACAA 18 50 

AAAAGAGTTT CCCCTTTATG GTATAAGTGT AGAAAAAAAC ACAAAAAGAA AGGAAACTCA 1920 

CATGAACAGT TTACCAAATC ATCACTTCCA AAACAAGTCT TTTTACCAAC TATCTTTCGA 19 80 

TGGAGGTCAT TTAACCCAGT ATGGTGGTCT TATCTTTTTT CAGGAACTTT TTTCCCAGTT 2 040 

GAAACTAAAA GAGCGGATTT CTAAGTATTT AGTAACGAAT GACCAACGCC GCTACTGTCG 2100 

TTATTCGGAT TCAGATATCC TTGTCCAGTT CCTCTTTCAA CTGTTAACAG GTTATGGAAC 2160 

GGACTATGCT TGTAAAGAAT TGTCAGCTGA TGCCTACTTT CCAAAATTGT TGGAAGGAGG 2220 

GCAGCTTGCT TCACAGCCAA CCTTATCCCG TTTTCTTTCC AGAACTGACG AGGAAACAGT 22 80 

CCATAGTTTG CGATGCCTCA ACCTTGAATT GGTCGAATTC TTTTTACAGT TTCACCAGCT 2340 
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AAACCAACTC ATTGTAGATA ACGATTCTAC CCATTTCACA ACTTATGGCA AGC 
(2) INFORMATION FOR SEQ ID NO: 91: 



(A) LENGTH: 4762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 

TTTGTATCTT TTTAGGTCTC TTTCAATCCA AACCCTTTAA ACTATACGTC ATTTCGGTTC 60 

CTGCAAGTCT TGTGGTAATT TTAGGTTTGA TTTTACTTTT CTTTTCACAA GAGCCTCTGC 12 0 

ACGCTTCTTA TTTGATGGTC GTCTTCCCTG TTTTCCTACT TTTATTGGTA ACCAATATTA 180 

AGAGTCAACA GAGGGGGCGT AGTGCTAGAA GAAGCCGAAG AGAAACGCCA TTATGCCTAT 240 

GGAGTCGTTT CTTCAAAGGA AATCTATATC TGCTAGTTTT TGGGTTTGTC TATCTTTTGT 30 0 

CTGTTCCTTT TTTGATGAAG TTTGTCCTTT ATCCAGTACC TTATCAAGAA CGTAATCGTC 3 60 

TTGCTGATTT GGTAAAAGAG GAGACAAATA CGGAAGATGC TATCTCATGC ATGGGATGAT 42 0 

ACTGCGACTC TTTATCGTAA GAGTGAGCGC TTGTCCCATC GGCGATTTTG TCCCCGTTGC 480 

ACTATACAGC AACTGAGGAA AATCGTAATA AGTTACTTAA TGACTTGAAA GAAAAACAAC 540 

CTAAGGTGAT TGTGGTAAAT GATAAGGTGG TAGTCTGGTC TGAAGTGGAA ACACTCTTAA 600 

AAGAAAATTA CCAACAAGTA AAGACTGATT ACTCAGAGTT TAAAGTCTAT AAAATTAAAT 660 

AACCAAATCA ATATCTTGTG TATTTTTAAA AATTTTAGGA TTTTTAACAC AAGATATTGA 720 

TTTTTCTTTT TAGAGTGGTA TAATACTTTT TAGAAAGAAC ATTTTAGAAA AGAGCATGCA 780 

TATGATTGCA CTAGAAGAAA AAATTACAAT TTTGCCAACT CTCTTCGTCG AGAAACGAGA 840 

TGGGAGACGT GTTGTATTTG ATGTGGACAA GATTGACAAG GCTCTCCACA AGGCGGCTGA 900 

CAAGGTTATG GATGTGACAC CCCTGGTTGA AAAATGCCTC AATGATCTGA CTGAGCGAAT 960 

TATTACAGAA ATTCATAGTC GCTTTCCACA GGGAATTAAG ATTTACGAAA T T C AAAAT AT 1020 

CGTAGAACAT GAACTCCTTG AAGCCAAAGA ATATGCGCTG GCTGAGGAGT ATATTACTTA 1080 

TCGGACACAG AGGGATTTTG AGCGCTCAAA AGCGACGGAT ATCAACTTTA GTATTCATAA 1140 

ACTTCTCAAC AAAGACCAGA CAGTTGTCAA TGAAAACGCT AATAAAGACA GTGATGTCTT 12 00 

TAACACTCAG CGTGATTTGA CAGCAGGGAT TGTTGGGAAA TCAATCGGAC TGCAAATGCT 1260 

TCCTAAGCAC GTAGCCAATG CCCACCAAAA GGGGGATATC CACTATCACG ATTTGGACTA 1320 

CAGTCCCTAT ACCCCTATGA CCAACTGCTG TTTGATTGAT TTTAAGGGTA TGTTGGAAAA 13 80 
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TGGTTTTAAG ATTGGAAATG CAGAGGTAGA GAGTCCCAAG TCTATCCAGA CTGCGACAGC 144 0 

ACAGATTTCT CAAATCATTG CCAACGTTGC TTCTAGCCAG TACGGTGGCT GTTCAGCTGA 1500 

CCGTATCGAT GAAATTTTGG CGCCTTATGC AGAGAAGAAT TATCAAAAAC ATCTCAAAGA 15 60 

TGCAGAAGAG TGGGTATTGC CTGAAAAACA GGAAGATTAC GCTTGGAAGA AAGCGCAAAA 1620 

GGACATCTAC GATGCCATGC AATCTCTTGA GTATGAAATC AATACTCTCT TCACTTCAAA 1680 

TGGACAAACA CCTTTTACTT CGTTAGGTTT TGGTCTGGGA ACCAGTCGTT TTGAACGAGA 1740 

AATTCAAAAA GCTATTTTAA ACATTCGCAT CAAGGGTCTT GGTTCAGAAC ACCGTACGGC 1800 

TATCTTTCCT AAACTTATCT TTACGCTTAA AAGAGGCCTC AACTTAGAGG AAGGAACTCC 1860 

CAACTATGAC ATCAAGCAGT TGGCTCTAGA GTGTGCAACC AAGCGGATGT ATCCAGACGT 1920 

CTTGTCTTAT GATAAGATTG TTGATTTGAC AGGTTCTTTC AAGGTGCCTA TGGGCTGCCG 1980 

TTCTTTCCTT CAAGGGTGGA AGGATGAAAA TGGTGTAGAA GTCAATTCAG GTCGCATGAA 2 040 

TCTGGGTGTT GTGACGGTTA ATCTGCCTCG TATTGCTCTT GAGTCTGAAG GTGATATGAA 2100 

TAAGTTCTGG GAAATCTTCA ACGAGCGAAT GAATATCGCA GAAGATGCTC TTGTTTACCG 2160 

TGTCGAACGC ACTAAAGAGG CGACACCAGC GAATGCTCCT ATTCTTTATC AGTACGGTGC 2220 

TTTTGGCCAT CGTCTAGGTA AAGAAGAAAG TGTTGACCAG CTCTTTAAGA ATCGTCGTGC 2280 

GACCGTTTCG CTGGGCTATA TCGGCTTGTA TGAAGTAGCG ACAGTTTTCT TTGGTAACAG 2 340 

CTGGGAAAGT AATCCAGATG CTAAGGAATT CACGCTAGAC ATCATTCACG ATATGAAACG 2400 

CCGTGTAGAA GAGTGGTCAG ACCAATATGG CTACCATTTC TCTATCTACT CAACACCATC 2460 

CGAAAGTCTG ACAGACCGTT TCTGCCGACT AGATATAGAC AAGTTTGCCT CTATTCCTGA 2520 

TATCACAGAC AAGGAATACT ACACCAACTC TTTCCACTAC GATGTTCGTA AAAATCCAAC 2 580 

ACCGTTTGAA AAATTGGACT TTGAGAAAGT CTATCCGGAA GCAGGTGCGT CAGGTGGTTT 2 640 

CATCCATTAT TGTGAGTATC CAGTCCTTCA GCAAAATCCA AAGGCCTTGG AAGCTGTCTG 27 00 

GGATTATGCT TATGACCGTG TAGGCTATCT AGGCACCAAT ACTCCGATTG ACCGTTGCTA 2 7 60 

CAAGTGTGAC TTTGAAGGGG ATTTTGAACC AACTGAGAGA GGGTTTGCTT GTCCAAACTG 282 0 

TGGCAATAGC GACCCTAAAA CAGTAGATGT GGTGAAACGA ACTTGTGGCT ACCTAGGTAA 2880 

TCCTCAAGCA AGACCGATGG TCAACGGGCG TCACAAGGAA ATCGCTGCGC GTGTCAAACA 2940 

TATGAATGGT TCAACGATTA AAATAGCTGG GCATCAAGTA ACAAATTAGA AAGAAATGAA 3000 

ATGGGAAAAT ATCAACTAGA CGATAAGGGG CGCGCACAAG TGACCCGTTA TCACGAGAAA 30 60 

CACTCTAAAG GTGGAGCTGG TAAGAAAGAA CGCTTGCTTA GCTTCAGAGA ACAATTTTTA 3120 
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AACAAGAACA AGAAAAAATA AAAGTGAGAG CCAGCTCTCG CTTTTCTCAT AGTGGGAGGT 3180 

AAGGATGGAA TTACGCAGAC CAAGATTAGC GGATAAGAAA GCTGTTTTAG ATATGATGAC 3240 

AGAGTTTGAA AAATTTCAGT CGCCTCACGA CGGCGGTTTC TGGGATACAG AGAACTTTGT 3 3 00 

GTATGAAGAC TGGTTAGAAA GCAATCAGGA ACAGGAAATG GGGATTAATC TGCCTGAAGG 3 3 60 

ATGGGTTTCT GCAATTCAGT TAGTGGCTTT TTCTGAGAAA GGTCAAGCAG TTGGATTTCT 3420 

TAATCTCCGG TTGCGCCTCA GTAACTTTCT ACTAGAAGAA GGTGGCCACA TTGGCTACTC 3480 

CATTCGTCCA TCTGAAAGAG GCAAGGGTTA TGCAAAAGAG ACTCTCCGTC AGGGCTTGCA 3 540 

AGTTGCTAAG GAAAAGAACA TCAAGAAAGC TCTGGTGACC TGTAGTGTGA ATAATCCTGC 3 600 

TAGCAGAGCA GTCATTCTAG CAAATGGTGG AATATTTGAG GATGCTCGCA ATGGAGTCGA 3 660 

GCGTTATTGG ATAGAGGTAG CGAATGAATA ATCCAAAACC ACAAGAATGG AAAAGCGAGG 3720 

AACTTAGTCA AGGTCGTATC ATTGACTACA AGGCCTTTAA CTTTGTGGAC GGCGAAGGCG 37 80 

TGCGCAACTC TCTCTATGTA TCAGGCTGTA TGTTTCACTG CGAGGGATGT TATAATGTTG 3840 

CGACTTGGTC TTTTAATGCT GGCATTCCCT ATACAGCAGA ATTAGAAGAG CAGATTATGG 3 900 

CAGACCTTGC CCAACCCTAT GTTCAAGGCT TGACTTTGCT GGGAGGGGAG CCTTTTCTCA 3 960 

ATACTGGGAT TCTCTTGCCA CTTGTTAAGC GGATTCGGAA GGAATTGCCA GACAAGGACA 4 02 0 

TCTGGTCCTG GACCGGCTAC ACTTGGGAAG AAATGATGTT GGAAACTCCA GATAAACTGG 4 0 80 

AATTCTTGTC ACTGATTGAC ATTCTTGTCG ATGGAAGATA TGATCGAACT AAGAGAAATC 4140 

TTATGCTCCA GTTTCGAGGT TCATCTAACC AACGAATTAT CGATGTGCAA AAATCGCTCA 4200 

AAAGTGGGCA AGTAGTGATT TGGGACAAGC TCAATGACGG AAAAGAAAGC TATGAACAGG 42 60 

TGAAGAGAGA ATGAAGAAAA AGGACTTAGT AGACCAACTA GTCTCAGAGA TCGAGACGGG 4320 

GAAAGTCAGG ACACTGGGAA TATACGGTCA TGGAGCTTCA GGTAAATCAA CCTTTGCACA 4380 

GGAATTGTAC CAAGCTTTAG ATTCTACTAC AGTAAATTTG CTAGAGACAG ATCCTTATAT 444 0 

CACCTCAGGA CGCCATCTGG TAGTACCCAA GGACGCGCCG AATCAAAAGG TGACAGCCAG 4500 

TCTGCCAGTG GCGCATGAAC TGGAGAGTTT GCAGAGAGAT ATCCTTGCTT GCAGGCGGGT 4560 

ATGGATGTCT TGACAATTGA AGAACCTTGG AAGGCTAGTG AGGTCTTGTC TGGAGCCAAA 4620 

CCAATTTTGA TTGTCGAAGG GATGTCTGTT GGCTTTCTAC CCAAGGAACT CTTTGAAAAA 46 80 

ACCATCTGTT TCTACACGGA TGAGGAGACC GAATTAAAGC GACGCCTTGC TAGAGATACG 474 0 

ACTGTGAGAA ATCGCGATGC GG 47 62 
(2) INFORMATION FOR SEQ ID NO: 92: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 3832 base pairs 

(B) TYPE : nucleic acid 

(C) STHANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

GATGCAGGTT TCGACCCACA TATTCCAGAA AATTACTTTA AAGATGATGA TGTTAATCAG 60 

GTACCTTGTC TTTGTTGGTC TTCATCTGCA GCCCTCTTTT TCAGTAATTG GGTAGACCAT 120 

GCGGTCTATC AGGAGACGCC TTTTGATTGG AGAAAGATAG AAGATGATGC ATCTGCATAT 180 

GGGTATTTAT AAGAGGAATT ATGACATATT TAGACGCTTT TAAATCAGGT ACCTTGGTTT 2 40 

TACCGAGTGC CCTGCTCTTG CATTTTAAGG AACTCTTTCC TTCTAGCGAC GATTTTCTGG 300 

TTTGGCAATT TTTCTATTTG CAAAATACGA CAGGCTTAGA AGAAATGTCG CCAAGCCAGA 3 60 

TTGCTGAAAG GATTGGCAAG GAAATTTCGG ATGTCAACCA GTCCATTTCT AATCTGACGG 42 0 

AAAGGGGACT GCTCCAGTAT CGTACTATCG AATTAAATGG CGAAATTGAA TTGCTCTTTG 4 80 

ATGCTAGTTT GGCCTTGGAA CGTTTGGATG ACCTGTTTGG AGCAGTTCAT TCAAGTTCAG 540 

ACCAGCTAAC ACCTCAAAAC CAGCTCAAGG ATTTGGTGGA AACCTTCCAG CAGGAGTTGG 600 

GACGATTGTT GACGCCTTTT GAGATTGAGG ATTTGACCAA GACACTAAAG GAAGATGGAA 660 

CCAGTGCTGA CTTGATTAAG GAGGCTCTTC GTGAAGCTGT TTTGAATGGA AAACCAAACT 72 0 

GGAAGTACAT TCAGGCGATT TTGAGAAACT GGCGCCATGA AGGAATCAAG AGTGTGGCTC 7 80 

AAATTGAGGC CAAGAGAGCA GAAAGAGAAG CAAGCAATCC TCAGTTGACA CAGGTATCTG 84 0 

CAGATTTCAT AAATGCCATG GATCTCTGGA AGGATTAATC CATGCAAGTA GGCTTGAAAT 900 

CCGAGTAAGA TTTGCAAGCT GTGTATAATT GTGATAGAAT AAATAGAAAA TAAATTGAAA 960 

AAAGAGGTAT GTGAAATGTC ACGTAAACCA TTTATCGCTG GTAACTGGAA AATGAACAAA 102 0 

AATCCAGAAG AAGCTAAAGC ATTCGTTGAA GCAGTTGCAT CAAAACTTCC TTCATCAGAT 1080 

CTTGTTGAAG CAGGTATCGC TGCTCCAGCT CTTGATTTGA CAACTGTTCT TGCTGTTGCA 114 0 

AAAGGCTCAA ACCTTAAAGT TGCTGCTCAA AACTGCTACT TTGAAAATGC AGGTGCTTTC 1200 

ACTGGTGAAA CTAGCCCACA AGTTTTGAAA GAAATCGGTA CTGACTACGT TGTTATCGGT 12 60 

CACTCAGAAC GCCGTGACTA CTTCCATGAA ACTGATGAAG ATATCAACAA AAAAGCAAAA 132 0 

GCAATCTTTG CGAACGGTAT GCTTCCAATC ATCTGTTGTG GTGAATCACT TGAAACTTAC 1380 

GAAGCTGGTA AAGCTGCTGA ATTCGTAGGT GCTCAAGTAT CTGCTGCATT GGCTGGATTG 1440 

ACTGCTGAAC AAGTTGCTGC CTCAGTTATC GCTTATGAGC CAATCTGGGC TATCGGTACT 1500 
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GGTAAATCAG CTTCACAAGA CGATGCACAA AAAATGTGTA AAGTTGTTCG TGACGTTGTA 15 50 

GCTGCTGACT TTGGTCAAGA AGTCGCAGAC AAAGTTCGTG TTCAATACGG TGGTTCTGTT 162 0 

AAACCTGAAA ATGTTGCTTC ATACATGGCT TGCCCAGACG TTGACGGTGC CCTTGTAGGT 1680 

GGTGCGTCAC TTGAAGCTGA AAGCTTCTTG GCTTTGCTTG ACTTTGTAAA ATAATCAGTA 1740 

AGTAGCAAAA GCTAGGTGGA ACAGCATTCA GATGTCTGTT ACATTTTTTA TAGGAGAGAA 1800 

AGATTGAAAA CAAAAATTGG ATTAGCAAGT ATCTGTTTAC TAGGCTTGGC AACTAGTCAT 18 60 

GTCGCTGCAA ATGAAACTGA AGTAGCAAAA ACTTCGCAGG ATACAACGAC AGCTTCAAGT 192 0 

AGTTCAGAGC AAAATCAGTC TTCTAATAAA ACGCAAACGA GCGCAGAAGT ACAGACTAAT 1980 

GCTGCTGCCC ACTGGGATGG GGATTATTAT GTAAAGGATG ATGGTTCTAA AGCTCAAAGT 2 04 0 

GAATGGATTT TTGACAACTA CTATAAGGCT TGGTTTTATA TTAATTCAGA TGGTCGTTAC 2100 

TCGCAGAATG AATGGCATGG AAATTACTAC CTGAAATCAG GTGGATATAT GGCCCAAAAC 2160 

GAGTGGATCT ATGACAGTAA TTACAAGAGT TGGTTTTATC TCAAGTCAGA TGGGGCTTAT 222 0 

GCTCATCAAG AATGGCAATT GATTGGAAAT AAGTGGTACT ACTTCAAGAA GTGGGGTTAC 22 80 

ATGGCTAAAA GCCAATGGCA AGGAAGTTAT TTCTTGAATG GTCAAGGAGC TATGATGCAA 23 4 0 

AATGAATGGC TCTATGATCC AGCCTATTCT GCTTATTTTT ATCTAAAATC CGATGGAACT 24 00 

TATGCTAACC AAGAGTGGCA AAAAGTGGGC GGCAAATGGT ACTATTTCAA GAAGTGGGGC 24 60 

TATATGGCTC GGAATGAGTG GCAAGGCAAC TACTATTTGA CTGGAAGTGG TGCCATGGCG 252 0 

ACTGACGAAG TGATTATGGA TGGTACTCGC TATATCTTTG CGGCCTCTGG TGAGCTCAAA 2580 

GAAAAAAAAG ATTTGAATGT CGGCTGGGTT CACAGAGATG GTAAGCGCTA TTTCTTTAAT 264 0 

AATAGAGAAG AACAAGTGGG AACCGAACAT GCTAAGAAAG TCATTGATAT TAGTGAGCAC 27 00 

AATGGTCGTA TCAATGATTG GAAAAAGGTT ATTGATGAGA ACGAAGTGGA TGGTGTCATT 27 6 0 

GTTCGTCTAG GTTATAGCGG TAAAGAAGAC AAGGAATTGG CGCATAACAT TAAGGAGTTA 2 82 0 

AACCGTCTGG GAATTCCTTA TGGTGTCTAT CTCTATACCT ATGCTGAAAA TGAGACCOAT 2880 

GCTGAGAGTG ACGCTAAACA GACCATTGAA CTTATAAAGA AATACAATAT GAACCTGTCT 29 40 

TACCCTATCT ATTATGATGT TGAGAATTGG GAATATGTAA ATAAGAGCAA GAGAGCTCCA 3000 

AGTGATACAG GCACTTGGGT TAAAATCATC AACAAGTACA TGGACACGAT GAAGCAGGCG 3060 

GGTTATCAAA ATGTGTATGT CTATAGCTAT CGTAGTTTAT TACAGACGCG TTTAAAACAC 3120 

CCAGATATTT TAAAACATGT AAACTGGGTA GCGGCCTATA CGAATGCTTT AGAATGGGAA 3180 

AACCCTCATT ATTCAGGAAA AAAAGGTTGG CAATATACCT CTTCTGAATA CATGAAAGGA 3240 

ATCCAAGGGC GCGTAGATGT CAGCGTTTGG TATTAAGCGA TGATTTGAAA GAGGGATGTG 3300 
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ATAGTAGCAC CCTCTTTTTC TTTGTTTTAT GATAGTTCAT CCTCGAGTAA ATTCAAGTTC 3 360 

TTGCTCGGAA ATGAAGCTTA TATAGTAGAT TGAATATAGA CAAATACCTT GTGATTGGTA 3420 

AAACATTTTA GAAATTCATT TACCTTTCCT AATCGACTTG GTTTCATCTT ATTTCAATCT 3480 

ATTATAGTAT TGGGGAATTT CTTCAAACCA CATCAGCTTG GTCAGTTCTA CCTGCGACCT 3540 

CAAAACTTGT GCTTTGGTCA AGCTGGGTTT AGTTTCCTAG TTTGCTGATG GATTTCCATT 3 600 

GACTATAAGC ATCCAACCCT CTTTTTGTCT TCTAAAGAAT TCTTAAATTA TCAGTCTATT 3 660 

GCAACTTTTC TCATATAAGT TCTTTGTCTT GCTATTGGTT TTCCTTAGTA GTATACTAAG 372 0 

GTAGTAATCA TTAAGAAGTG GTTACAAAAA ATAATGAATG AGGTAAAGAA AATGGTAGAA 3 780 

TTGAAAAAAG AAGCAGTAAA AGACGTAACA TCATTGACAA AAGCAGCGCC GG 3 832 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10690 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 
TGAAAAAATC CTCATGAACC TGGCGCCAAT AGACAAGTGT CTTGTTTCCC TCACCTTCCT 
TATAGGCATG GTCAGCTGAC ACTCGATTGA AGGGTTTAAC AGAAACCTTT G 
CAATGCAGAC AGCCTGATTT TGACTATCTA AAATGACATC GAAGGTCCCT ; 
GTGGTTCGTC TTCTAGCACA TAGAGGTCAT AGGCTGATGC TGTTGCTGTC TTTTCTCCTT 
TAAACACCAA ATCCGCTAAA AGGTCTGGTT CAACTCCAAA AGCCCAGGCA TCGATTTCAT 
CTCCGATCAA AGGATTGATT TGCTTGTATT TATTCCACAT TTCTTGCGGT ATCATGGGTG 
CTCCTTTGTA ATTTTTTACT TTCTTCTTTT ATGTGTTTAA GATGATCTGG ATGGTCAATC 
TCTAAATCAA AAATCTCTGG AATAGAACTG TAGTGGATAA TGCACTTGAT ACCCAACTGA 
TTCATTTTTT GTATGAAAGA AGTATTCAGA TAGCCTGCTA CAGCAAAATC AATCTTGTTC 
TTTCTTGCTT TATCCTGCAT ATCTCTTAGC ATATCTAACA TTATTGGACT TTCCATATCA 
TGCCATTGAC TGTTTCTCAT AGTCGCAAAA ACAAAGGAAG TCAAATCATT CATTCCAACT 
ACAATCTTTG AAATGCCCGT TTCCAGTATA CTAGATAAGT CAAAATACGC TGACGGTAAT 
TCAATCATCG TTCCGACTTT CCCAGTAAAA CCCTGCTGAC GCAATACTGT AATAGCTTGT 
TTTAATTGGT CGGCATCATT GACAAAAGGA AAGATAACAG ATAGATTGGG GTTGGTTTGA 
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TAAACTTCTG TAACGACATG TGCTTCAGCC TGAAATTCAT CCAAACACGC CAGTAAACGC 900 

CTAGTTCCTC TATAGCCAAA CAAGGGATGC CCTTCGTCAA AAAACTCTTT AGTCCCCACT 960 

AAACAATTGG CTTCTGTATT CGTTAATTCA GTAAAACGAT ACCAAACTTC CTTACCTAAG 1020 

TAAAAGGAGC AAATAGTATC AAGATAATCT TTCACAAATT CCTGACAACT TTGTAATAGT 1080 

ATATTTTGAT TGAGCTCTCT CAATAAGTAT TCCCCACGAA TCATGCCGAC GTGGTGAAAT 1140 

AGTTGAGGAT AAATTTTTTC AAGAATTTTT TCGCCACTAA GGGCAAGTTG ATTTCTCATC 1200 

ATTCACCTTC CAATTCATGT AAGAAGTCTT GTCCAGTTCT GGAAATCCTA ATAATTCAGA 12 60 

CTTAACCTTC AAGACTAATG GCGATGCATT TTCTTCTGTA ATCTCTTGAA TATCCATCCA 1320 

AATATATCCA AGTGAATCAT TCGCACCATC AGACACAGCT TCCGAAATCG TAACTTGAGG 13 80 

TGCACTCTCA TTCATTTCAA CATCATACAA GGCTATGACA TGGTGAACCA TAAAATTTTT 1440 

TAACTCTTCC CTGACGAAAA CATCGTAGAT TCGAGGATTA GAGTAGCTTC TAACAGTAAA 15 00 

TCCCGTCTCT TCCATAACTT CTCTAGTCAG CGTTTCCGTC AGTCCTTCAC CAAGTTGCTG 15 60 

ACTGCCTCCA GGTAGATCAT ACCGATGTTG ATAAGGGCCT CTCGTTTTTT CAATGCAAAG 1620 

TAACTTTCCA TTTTCAAAGC AAACACAGTA GACCCCAAAG TGATTTTTGA TTTCCATCCA 16 80 

ACTCCTCCTA CTTCAAAGAC CAGCCACCAT CTATTGTCAA GATTTGTCCT TGCATGGCGC 17 40 

TCGCTTTTCC ACTTGCTAAA AAAAGACTAA GCTCTGCTAT TTCCTCTGGC TCAATCCAGC 1800 

GCTTGATTGG GGTTTCACTA GCCACCCAGT CAGCCAAACC ACCTGGTTCA AAATCCGCAG 18 60 

CGGTCATAGC TGTCTTGACT GCTCCTGGAG CGATACCAAA GACCTGAATC CCAGCTTCAG 1920 

CATAGTCTAG AGCCAACTGC TTGGTGAAGC CAGCCAAGGC ATGCTTGGAT GAAGTATAGG 1980 

CGTGACCACC TCCACCTGCT AGGCTAGAAG CAATGGAACA CATATTGATG ATGATTCCCT 204 0 

TTTTATTTTC CAGCATTTGT GTCAAATAAT ACCGAGTCAA CTCTACTGGA ATAATGTAGT 2100 

TGATTTCAAA AATCTCTTGA ATGTCCTGCG CCGTTTGTTC CAACAGTGGT TTGTAATCAT 2160 

CCAAAACTCC AGCAGTATTA CACAAAACAT CCACCTGAGG GCACCAGTCA AAAATAGGTT 222 0 

CCAAGTCCAA GGTCAAATCT CTCTGTAAAA AGCGAAAATC ACCCTCTAAG AGTGGCTTTT 22 80 

CACCTTGGTC AACTCCATAA ACTTGATAGC CCTTCTCTAA AAAGAGGCGA GCTTGAGCCA 2340 

ATCCGATCCC TGAACTCACT CCTGTAATGA GTACACGTTT AGTCATGCAC TTCTACCCAA 2400 

TCCGTTGCCA AAACATCACA AACTGTCGGG CTCCACATGG AAAAACCTTC TCCTTCGCCA 24 60 

GAAACGTTGA TTAGGAAATA AGGTGTCATT TCAAGTGCAA GCCCATTTTG CTCGATGGTA 2520 

TCAAAGAGTT GGACATAGTT TTCCGCACCT CCCCAACCAG TTCGTACATA TTTTCTCTTA 2 580 

GCCTTTAACC CAGGCAGGAT CTCTTCAAAT GTCATGTTTT TCTCCTTTAA TTCTACATTC 2640 
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TTCATTTAAT TATAGCAAAA AACCGCTTTA TACGGCTTTT TGAATGTGAG TTATTCAAAC 2 7 00 

CTGCTACTAC TTACGGCAAA TTATTCCCTG CAGCAAGATA AATTTCATAC CATTCTTTTC 27 60 

TTGTTAAGCT AAAGTTTGCC GCTCGGCTAA CTTCTCTCAA GTGCTTAGGA TTTGTTGTAC 2 82 0 

CTACGACTGC CTGCATTTTT GCTGGATAAC GCAATATCCA AGAAATGGCA ATAGTTGAAG 2 8 80 

AGGTTACTCC ATATTTAATA GCTAAACGAT CAAGTACTTG ATTTAAAGCT TGAAATTTCT 2 940 

CATTTCCAAC AAAATTCCCT TTAAAATACC CGAATTGTAA GACAGACCAT GCTTGAATGA 3 000 

CCACATCGTG TAATTGGCAA TATTCAAAAA TGCTGCCATC TCGCATAGCT GCTTGACTAT 3060 

CTTCCATATT AACATGAAAA GCTGATTCAA ATCCTGGAGT AAAAGCCGCA CTCAATTGTA 312 0 

GCTGATTAAC AGCTAACGGC T G CTTGAC AT CTTTTTTAAG CAACTCCATC ATCATAGGAT 3180 

TTTGATTAGA AACTCCAAAA TCTCGAACTT TACCTTGTTT ATAAAGGAGA TTAAAGGCTT 3240 

CTGCTACTTG GTCAGATTCC ATCAAAGCAT CTGGTCGATG AAGGAGCAAG CTATCTAGAT 3 3 00 

GATCAATCTT CAATCTTTGC AAAATACCGT CTACTGATTT TATAATATAG TCCTTAGAAA 33 60 

AATCAAAATA GGTAAATTCT TCAATGCGAA TGCCACATTT GGACTGAATC CACATCTTTT 342 0 

CTCTTAAATC TGGACGATTT TTTAGGACAA GACCTAACAG TTCTTCACAA CGACCACGAC 34 80 

CATAAATATC AGCCAAGTCG AAGGCATTGA TTCCAACAGA AAGTGCTGTT TCTACAAGCT 3540 

CTTCAACTTC TTTTACAGAT TTATCTTTTA TTCTCATCAT TCCGAGAACA ATTTCTGATA 3600 

ATTCTTTGTC ATCTTGACCA AGAGTTATGT ATCTCATCAA ATTTTTCTCC TTTAATTTCT 3 660 

AACATTCTTC CCTTCATTAT AACAAAAAAC CGCTTTGCAA CGACTTTTTG ACTATACTTC 372 0 

ACTCCATTTT ATCTTCTTAA ACCCACGGAA CAAGACAAAG ATTCCAATAA AGAGGACAGC 3780 

TAAAGGAATA ACTTTTGTAA GGAAAACATT TGAAATTCCC ATCCACTCAT AATAACGGAG 3 84 0 

CAGAGAACCC ACCACAAGAT GGGCAATAAT CATACTGACA AATGGACGAA AGACCGCTTC 3900 

TTTCCAATTC CAAATACCGA TAACTAGCGA AATCGTAAAG ACAGACAAAC TATCCCAGGG 3 960 

AGCCGGAATA TAAAAGGCTC CTTCTTGTAT GAAGCTTGCC ATTCCTACAT ATCCTAAAAC 4020 

AACTAGAAGA ACTATAGTCC CAACAACAAT GTAAGTGCCA ATTTTCATTT TAGGAGAATC 4080 

TTGGACTAAA CTTCTTCGTA AAATTGTGGC CACAAGTCCA AATCCAATCA GAAAAATAAG 4140 

AAGTTGCCCT AAAAATGTGA GCAAATTGAC TGTTAAGAGA GGACCTTTAG AAAAATCACT 4200 

TAGTAGTTGA TAATAACGTA ATACCGCCAG GACAAGAATT GGCGTCAAAA GGGACTCTTT 42 60 

GATAGAACTG CGAGGTGCTC CCTTGAGAAT CTCTTTCATT ATTTTTTTAG GATTCTTACC 4320 

TAGATAATCC TCTGCACTCA TGCCATCTCG TTCTGCTTCT GAGAAATCTA GCATCATCAA 43 80 
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ATAGATCTGC TCTCTGAGAT AGTCTTCATC ATAGAGAAAT CCAGCAAGAT TAAAACTTTC 4440 

CCACAACTCC TCAAAATACT TTTGATTCTC CTCAGAAAAC TCATGTAGCA AAGCGCTTGT 4500 

TTCTTCGTAA TACTTCATTT TCTTCATGGT TTAACCCCCA TTCTTAATCC CTTCTACTTT 4560 

TTGACTCAAA TCGTCCCATT GTTGCCAAAA GACTGAGACA CGCTCTTCTC CTTCTTTCAT 462 0 

TAATGAAAAA TACTTCCGAT CTGGACCATC TGGCGACGGG CGCATGTCGC CTCTTATCCA 4680 

TTGATTTTTT TCTAACTTTT GCAACAAAGG ATAAATAGTT CCTGGAACGA TAGTATCAAA 4740 

TCCAGCCTCT CGCAAAGTCT GAACCAACTC ATAACCATAC CGCTCTTTTT GACCAATCAT 4800 

ATCCAAGACA CAACCTTCAA GAACACCTTT TAATAGCTGA GTTTCTTTCA TCACTTCTCC 48 60 

CTTCTAATCT ATTTTGTAAT ACCTACTAGT GACTTCACCT ATAGTATATC ACTTCTACAC 492 0 

TAGTTTGTAA AGCATAATAG TTAATACTCT TCGAAAATCT CTTCAAACCA CGTCAGCGTC 4980 

GCCCTACCGT ATGTATGGTT ACTGACTTCG TCAGTTTCAT CTACAACCTC AAAAACATGT 5040 

TTTGAGCTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAC AGTGTTTTGA GCTGACTTCG 5100 

TCAGTTTCAT CTACAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT TTCATCTACA 5160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTTCGT CTACAACCTC AAAACAGTGT 522 0 

TTTGAGCAAC CTGCGGCTAG CTTCCTAGTT TGCTCTTTGA TTTTCATTGA GTATAAATAA 528 0 

AAAAACAGAA CTAGCCTGAA CTAGTCCTGT CTACTTTTAC CCAATCACAC TTCCATTTGG 5340 

TACAGCTGGA TCAACTGTGA GAAGGGTTAA TTTGCCATCA TGTTCAGCTG AGAGAATCAT 5400 

ACCCTGGCTG ACATATTTTT TCATCATTTT ACGTGGTTTG AGGTTAGCAA CGATTTGAAC 5460 

TTTCTTGCCG ACCAATTCTT GTTCATTTGG ATAGTATTTT GCAATTCCTG AAAGAATCTG 5520 

ACGATCTTCT CCATCACCAG CATCCAAGCG GAATTGAAGC AACTTATCTG AACCTTCTAC 5580 

TTTAGACACT TCTTTGACTT CTGCGACACG GATTTCAACC TTGTCAAAGT CTTCAAACTT 5 640 

GATTTCATCC TTGTTTAGTT TGAGCTCAAC TTCGTCCGGA TTCCATTCTT TTTCGACTGC 5700 

TGGTTTATTG CCTTCCATTT GTTCCTTGAT ATAGGCGATT TCTTCTTCCA TATTTAGACG 5760 

TGGAAAGATA GGTGTTCCTT TGGCAACTAC AGTCACATCT GCTGGGAAGT CAGCCAAACT 5 820 

CAAGTTTTCA AGACTAGAAA CTTCTTCCAA ACCAAGTTGA GTCAAAACTG CACGACTAGT 5880 

TTCCATCATA AATGGTTCAA TCAAGTGAGC AACTACACGA ATGCTGGCTG CCAAGTGGCT 5940 

CATGACACTT GCCAATTGGT CACGAAGAGC TTCATCCTTG GCCAAGACCC ATGGTGCGGT 6000 

CTCATCGATG TATTTATTGG TACGAGAGAT CAGAGTCCAG ACTGCTTCAA GCGCACGTGG 6060 

ATAGTCAACT GCTTCCATGT GTGTATGGAA GTCTGCGATT GATTGTwCTG CAACCTCAGC 6120 

AAGAACATGA TCATATTCAG TCACACCTTC TACATAGGCA GGGATTTGTC CATCAAAGTA 6180 
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CTTATTAATC ATGGAAACCG 
GTTGATACGG CCGACATAGT 
ACGCATGAGG TAGTAACGAA 
AACGACATTC CCTTTTGACT 
AATCAAACGA TCAGGTAATT 
GTGGAAGCGA AGGATATCTT 
AAAGTTACCA TGTTCGTCTT 
AATCCAAACG TAGACAACGT 
TGTACGAGAT ACCGCCAAAT 
AAGGCGACCA TCTGGCGTGA 
TTGGTATTTG CTAAGGCGAA 
TGATGGAGCA ATACCACCAG 
GCTTTCTGTA AAGAATTCTT 
GATATCATCT TGAGCAAGTA 
ATCAGTTGTA CGGATAAATT 
TCCAACCGCC ATTCCATCAA 
CTGGATTTTC TGACCATGTT 
GCGTTTGTAA CGTGCTAGGA 
TTTCCCAGAT GGATAGTAAA 
TCCTTTCCAG GCAAATGAAA 
ATTTCAATAG GGAAATCCAT 
TTCATAACGA AGGGCTTCAA 
AAAAATCATA CCAACACTAG 
TCCAACTTCT ATACCTTCTA 
ACTTGCAATT GTCAAGCCAA 
AAACTGAATT AAAATATTGG 
AGTGCGCTCT GTCACCGAAA 
AGAAATCCCT GCGATGGAAC 
TGCAAATACA ACGGACTCAT 
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TACGGTTAAG GAGGTTCCCA AGGTCATTAG CCAATTCATA 6240 

CTTCAGGAGT AAAGGTTCCG TCTGAACCAA CTGGAAGGTT 6300 

GTGGATCTAG TCCATAACGC TCTACCAACA TTTCAGGGTA 63 60 

TAGACATTTT TCCGTCTTTC ATGACAAACC AACCATGGGC 6420 

TAACATCCAA CATCATAAGA AGGATTGGCC AGTAGATAGA 64 80 

TTCCTACCAT ATGGAAGACT GTTCCATTCC AGAACTTGTC 6540 

GAGCGTAGCC AAGAGCTGTC GCATAGTTAA GAAGGGCATC 6600 

GTTTTGGATT TGATGGGACA GGCACTCCCC ATGTAAAGGT 6660 

CTTCCAAGCC TGGCTCGATG AAGTTGCGTA GCATTTCATT 672 0 

TAAATTCAGG ATGAGCTTTG AAAAATTCGA CCAAACGGTC 6780 

GGAAGTATGA TTCTTCAGAA AcCCATTCAA CCTCATGACC 6840 

TCACATTTCC AGCTTCATCA CGGAAAACTT CTGCCAGCTG 69 00 

CGTCTGATAC TGAATACCAA CCAGAGTATT CACCCAAGTA 69 6 0 

AGCGTTCAAA GACTTGTGCG ACAACTTTTT CATGGTAGTC 7020 

TATCGTATGA GATATCTAGT AATTGCCAGA GTTCTTTAAC 7080 

CATAGGCTTG AGGTGTAATA CCAGCTTCTT CCGCTTTCTG 7140 

CATCAAGACC TGTCAGATAA AATACATCGT AGCCCATCAG 7200 

CATCACATGC GATAGTTGTG TAGGCAGAAC CGATATGAAG 72 6 0 

TCGGCGTTGT AATATAAAAA TTTTTTTCAG ACATAATTTT 732 0 

CCTGTTTTTC TAACACTTCA T TAT ATCAC A TTTTTAftTGA 73 80 

ACAAAAACAA GATAGACGAG TGTCCATCTT GTTGATCTCA 7440 

TTGGATCAAG TTTCGATGCC TTGTTGGCTG GCAAGACTCC 7500 

CCGAAACTGC AAGACTAAAT AGGGCGACTG GGATTGATAC 75 60 

TTAAACCTTG CAGTAACAAA CCTGCTAAGg CAGTTAAACC 762 0 

TTAAGCCACC TAACAAGGTC AAAATCATGG ATTCAATCAA 76 80 

CACGTGTTGC ACCCAAAGCC TTACGAAGAC CAATCTCACG 7740 

CCAGCATGAT GTTCATGACA CCAGTTCCTC CAACAAAGAG 7800 

TAATAATCGT CGTCATAAAA CTAAACGATT GTTGAATTTC 78 60 

CTGCCACCTG GTATTCTCCC TGTTGTAAGC CTGCAAGCTC 7920 
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TGTCATTTTT CGTGCCAGTT CTGGACCCAG AGTTGGGGTT AAACTGGTAT CATTCACTCG 7980 

AAAGACAATA TTAGCTATTT CATCTACATT AAAATTCGCA GCAAGGGAGA TATTGGTAGT 8040 

AATAGGCAAG CCACCAAACC CATATATTTT TGATCTTTTA GCCTCCGGAC TAGTATAAAC 8100 

CCCAATGACC CGGTAACTAA ATCCATTGAC TTCTACAACC TTGTTAATAG CCTCTTGAGG 8160 

AGATTCAAAT AAACTAATGG ACAATTCCTC ATCTAGCAAA ATGACACTTG CAAACTCTTT 8220 

GAAATCTTGC TCTCTCAGAC TACGACCTGC AATAATTTCA TTCTTAACAG CGTCCATGTA 8280 

AGTTCTGTTT CCACCTGTCA AATTAGCATT CTCAACCTTT TTATCTTGAT AGGTCAAGAT 8340 

GGCATTCGTT GAATTGGTTA CATAGTAACT ATCCACTCCC TTCAGTTTAG CTGCCTCTTG 8400 

GACCCAGGAT TCTTGCGGTT TTGGCGGTTC AACAGGAACT TCCTCTTCCT TTCCAGAAAC 8460 

CGTAAAAGCT GATTGTTTCT GAGTAAAAGA CCCGTCTTTA CTTTTTTTAG GAGAGAAAAA 8520 

GACGCTAATA TTTTTCTGAG ATTTAGTCAT ATCTTTATTG ACTTGACGAG ATAGGGAATC 8 580 

ACCCAAAGCC ATAATCACAA CAACTGATGA AACACCGATA ATAATCCCAA TCATAGTAAG 8640 

CAAAGAACGC ATCTTGTGAG CCATGATAGA TGAAAAGGCA AATTTCAGAT TCTGCATCTT 8700 

AGTTTTCCTC CTTTCCTAAC TGAGCACTGT CAGACGAAAT GACCCCATCC CGAATGACAA 8760 

TCTGACGTTT GGCATAGGCA GCAATCTCAG GCTCATGCGT TACCATGATA ATGGTTTTTC 8820 

CTTCTTTATT CAAATCAACC AATAATTGCA TAATTTGGTT ACCTGTTTTG GTATCCAAGG 8 880 

CTCCTGTCGG TTCATCCGCT AGGATAATAG AAGGATTGTT TACCAAGGCA CGCGCAATGG 8 94 0 

CTACACGTTG CTTTTGACCA CCAGATAATT CTGAAGGTAA ATGGTGACTA CGTTCTGTCA 9000 

ATTCAACCTT GTCTAAATAT TCCTCAGCCA ACTTGCGACG TTTTGAAGAC GAAACTCCTG 9 060 

CGTAAATCAA GGGCAATTCT ACATTTTGCA GAGCATTGAG CTTCGATAGA AGAAAGAACT 9120 

GCTGAAAGAC AAAACCGATT TGTTGGTTAC GGACCTTAGC TAGTTGTTTT TCACCAAGCC 9180 

CAGCCACTTC TTGACCTTCA AGATAATATT CTCCACTGGT TGGTGTATCC AACATGCCAA 9240 

TCGTATTCAT CAGAGTGGAC TTACCAGACC CAGATGGTCC CATGATGGCT ACAAATTCAC 9300 

CCTCATTCAC TTCTAGATTG ATATTTTTGA GAACCTGCAG TTCTTGGTCA CCATTACGGT 93 50 

AACTTCTGAA GATATTTTTT AGACTAATTA GTTGCTTCAT CAGCCTTCAC CTCTTTTCCT 9420 

TCTTCCAAGG AAGATGTTGG ATTACTGATG ACCTTAGCAC CGTTCGTTAA ACCAGAAGTG 94 80 

ATTTCTTGAT TTTCTGCGTC AGCATTTCCC AATGAAACCT CAACTTTTTT AGCCTTTTGT 9540 

TGTTCATCCA CAATCCAGAC ATAATTTTTA CTATCATCCA TTACTAGACT GCTAACAGGA 9600 

ACAAGAATAG CCTTAGTTTT GCTTTTAACC TCAATGTTGA CAGAAAAACC TTGTTTCAAA 9660 

TCACCAACCT CGCCTGTCAC ATCAATAGTA TAAGGGTATT TAGAACCTGT ATTATTCCCG 972 0 
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GCTGCTGGAC TAGCTGCTTC ACCATTGTTT TTAGGATAGT CAGAAATATA GCTTAATTTC 9 7 80 

CCAGTCCATT TTTTATCAGG ATACACTTTA GAAGTAAAGC TTACTTCTTG ACCTACAGAA 9840 

AGGTTGGCTA GATTGTACTC AGACAATTCT CCCTTGACTT GTAAATTTTC ATTGCTGACA 9 900 

ATATGAACCA TAACTTGACT CGCCCCTGTT GGAGATTTAG AAACATTGCT ATTGACTTCG 99 60 

ACCACAGTTC CCTCTAGGGT ACTGAGAACA GTTGTTGCAT CCAATTGACT TTGAGCCTTG 10020 

CTTAATTGCG CCGCAGCATC TGCACGCGCA TCACGGGCAT CACCCAATTG AGCGTCAATA 10080 

GAAGCAACAG AATTTCCAGC CACTGGAGTT GGGCTTTGCA CCGTTGCATC TTCTCCTCCT 10140 

ACTGGCGCTG GTAACTGTGG AGCCGGAGCT GAAGCGGCTT CATTTCGTGC TTGATTGAGT 102 00 

TCATTGATAT GACGATCTGC CCTAGCTACT GCTCGACTAG CTGAATCATA GGCCGCCTGC 102 60 

GCTTCTGAAC TACTGTACTT GACTAAAGCC TGCCCTTCGC TGACCTTATC GCCCACAGAA 1032 0 

ACAAGGATTT CATCTAAATC ACCCTTACTA GCATCAAAAT AAACATATTG TTCATTTTTT 103 80 

GCTGTTACTG TCCCTGACAA TAAAACAGAG GAGGCCACGC TTCCTTCCTT GGCAACAACA 1044 0 

AGATGAGTAG GCTCATCTTT TAGAGCAGTC TGAGAAGGTT GTCTAAAGAG TAAAATCCCC 10500 

CCAGCACCCA ATACAACTAC ACTCGCAGCA CCGATTGCTG CATACAGTTG CCACTTTTTA 10560 

GCTTTACCAT TCTTTTTCTT CATAATGAAA CTCCTTTTCT TTTTTACAAT ACTTTGCTAT 1062 0 

TATACCAAAT TTCCCTCCAG CAAACAATAC AGTTCAGGAT TAAACAATCG TTCGGAATTT 10680 

TGCTTTTCGG 10690 
(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8195 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

GAGAAAGCGC CCACGTTTCC CCGAAGGGAG AAAGGCGGAC AGGTATCCGG TAAGCGGCCA 60 

GGGTCGGAAC AGGAGAGCGC AACGAGGGAG CTTCCCAGGG GGAAACGCCT GGTATCTTTA 120 

TAGTCCTGTC GGGTTTCGCC ACCTCTGACT TGAGCGTCGA TTTTTGTGAT GCTCGTCAGG 180 

GGGGCGGAGC CTATGGAAAA ACGCCAGCAA CGCGGCCTTT TTACGGTTCC TGGCCTTTTG 24 0 

CTGGCCTTTT GCTCACATGT TCTTTCCTGC GTTATCCCCT GATTCTGTGG ATAACCGTAT 300 

TACCGCCTTT GAGTGAGCTG ATACCGCTCG CCGCAGCCGA ACGACCGAGC GCAGCGAGTC 360 
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AGTGAGCGAG GAAGCGGAAG AGCGCCCAAT ACGCAAACCG CCTCTCCCCG CGCGTTGGCC 420 

GATTCATTAA TGCAGCTGGC ACGACAGGTT TCCCGACTGG AAAGCGGGCA GTGAGCGCAA 480 

CGCAATTAAT GTGAGTTAGC TCACTCATTA GGCACCCCAG GCTTTACACT TTATGCTTCC 540 

GGCTCGTATG TTGTGTGGAA TTGTGAGCGG ATAACAATTT CACACAGGAA ACAGCTATGA 600 

CaTGATTACG AATTCGAGCT CGGTACCCGG AAAATCCAGA AAATGCTTGA AAAAAATCCT 660 

AGAAGATGGT ATAATACTAA ATTGTAAGGG TTATCACATA TAACTCAAAA AAAGAAAGAA 72 0 

CAAAAGGAGA GTCAAACTAT GGCTTCTAAA GATTTCCACG TAGTGGCAGA AACAGGTATT 780 

CACGCACGTC CAGCAACATT GTTGGTACAA ACTGCTAGCA AATTTGCTTC AGATATCACT 840 

CTTGAGTACA AAGGTAAATC AGTTAACCTT AAATCAATTA TGGGTGTTAT GAGTCTTGGT 9 00 

GTTGGCCAAG GTGCTGACGT AACTATCTCA GCTGAAGGTG CAGATGCAGA TGACGCTATC 960 

GCTGCAATCT CAGAAACAAT GGAAAAAGAA GGATTGGCAT AAGGGAAATG ACAGAAATGC 102 0 

TTAAAGGAAT CGCAGCATCT GACGGTGTTG CAGTTGCAAA AGCATATCTA CTCGTTCAGC 1080 

CGGATTTGTC ATTTGAGACT ATTACAGTCG AAGATACAAA CGCAGAAGAA GCTCGCCTTG 114 0 

ATGCCGCTCT ACAGGCATCA CAAGACGAGC TTTCTGTTAT TCGCGAGAAA GCAGTAGGTA 12 00 

CGCTCGGTGA AGAAGCAGCT CAAGTTTTTG ATGCTCACTT AATGGTTCTT GCTGACCCAG 12 60 

AAATGATCAG CCAAATCAAG GAAACTATCC GTGCGAAGAA AGTGAATGCA GAAGCAGGTC 1320 

TGAAAGAAGT TACAGATATG TTTATCACTA TCTTTGAAGG CATGGAAGAC AACCCATACA 13 80 

TGCAAGAACG CGCAGcGGAT wTCCGCGACG TGACAAAACG TGTATTGGCA AACCTTCTTG 1440 

GTAAAAAATT GCCAAACCCA GCTTCTATCA ATGAAGAAGT GATTGTGATT GCGCATGACT 15 00 

TGACTCCTTC AGATACAGCT CAATTGGACA AAAACTTTGT AAAAGCTTTT GTAACCAACA 15 60 

TTGGTGGACG TACAAGCCAC TCAGCTATCA TGGCACGTAC ACTTGAAATT GCTGCTGTAT 162 0 

TAGGTACAAA TAACATCACT GAAATCGTTA AAGACGGTGA CATCCTTGCT GTTAACGGGA 16 80 

TCACTGGAGA AGTGATTATC AACCCAACAG ATGAACAAGC GGCAGAATTT AAAGCAGCTG 1740 

GTGAAGCCTA TGCGAAACAA AAAGCTGAAT GGGCACTTTT GAAAGATGCT CAAACAGTGA 1800 

CTGCTGACGG TAAACACTTC GAGTTGGCTG CTAATATCGG TACTCCAAAA GACGTTGAAG 186 0 

GTGTTAACAA CAACGGTGCA GAAGCTGTTG GACTTTACCG TACAGAGTTC TTGTACATGG 192 0 

ATTCTCAAGA CTTCCCAACT GAAGATGAGC AGTATGAAGC ATACAAGGCT GTTCTTGAAG 1980 

GAATGAACGG TAAACCTGTT GTCGTTCGTA CAATGGATAT CGGTGGAGAT AAGGAACTTC 204 0 

CTTACTTCGA TATGCCTCAC GAAATGAACC CATTCCTTGG ATTCCGTGCT CTTCGTATCT 2100 

CTATCTCTGA GACTGGAGAT GCTATGTTCC GCACACAAAT CCGTGCTCTT CTTCGTGCGT 2160 
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CTGTTCACGG TCAATTGCGT ATCATGTTCC CAATGGTTGC GCTCTTGAAA GAATTCCGTG 222 0 

CAGCGAAAGC AGTCTTTGAT GAAGAAAAAG CAAACCTTCT TGCTGAAGGT GTTGCAGTTG 2280 

CGGATAACAT CCAAGTTGGT ATCATGATCG AGATTCCTGC AGCGGCTATG CTTGCAGACC 234 0 

AATTTGCTAA AGAAGTTGAC TTCTTCTCAA TTGGTACAAA CGACTTGATC CAATATACAA 2400 

TGGCAGCAGA CCGTATGAAC GAACAAGTTT CATACCTTTA CCAACCATAC AACCCATCAA 2460 

TCCTACGCTT GATTAACAAT GTGATCAAAG CAGCTCACGC TGAAGGTAAA TGGGCTGGTA 252 0 

TGTGTGGTGA GATGGCTGGT GACCAACAAG CTGTTCCACT TCTTGTCGGA ATGGGCTTGG 2580 

ATGAGTTCTC TATGTCAGCA ACATCTGTAC TTCGTACACG CAGCTTGATG AAGAAACTCG 2640 

ACACAGCTAA GATGGAAGAG TACGCAAACC GTGCCCTTAC AGAATGCTCA ACAATGGAAG 2700 

AAGTTCTTGA ACTTCAAAAA GAATACGTTA ATTTTGATTA ATCGAAAAGT CCCTGCAACT 2760 

CAGTTACAGG GATTTTTTTG ATATTTTAAA AAGAATTTTC AAGAAAATCT TTCTTATAGA 2 820 

AAGTCCAACC TTGAAAAAGT AGTGGTCAGA ACAAAAAATA CTTAAATGGT TCATAAAATT 2 880 

CTTGACAAGT TGGATATTTA GGAGTAAACT ATTAACCAGT TAAGTAATAG AGAGGAGTTT 2 940 

CTGCAATTTA GAAATGAATT GCAACTAGAA ATATCAAATA GAAAGAGAGT TTCGATGAAA 3 000 

ATTAATAAGA AATACCTTGT TGGTTCTGCG GCACTTTGAT TTTAAGTGTT TGTTCTTACG 3060 

AGTTGGGACT GTATCAAGCT AGAACGGTTA AGGAAAATAA TCGTGTTTCC TATATAGATG 3120 

GAAAACAAGC GACGCAAAAA ACGGAGAATT TGACTCCTGA TGAGGTTAGC AAGCGTGAAG 3180 

GAATCAATGC TGAGCAAATC GTCATCAAGA TAACAGACCA AGGCTATGTC ACTTCACATG 3240 

GCGACCACTA TCATTATTAC AATGGTAAGG TTCCTTATGA CGCTATCATC AGTGAAGAAT 33 00 

TACTCATGAA AGATCCAAAC TATAAGCTAA AAGATGAGGA TATTGTTAAT GAGGTCAAGG 3360 

GTGGATATGT TATCAAGGTA GATGGAAAAT ACTATGTTTA CCTTAAGGAT GCTGCCCACG 3 420 

CGGATAACGT CCGTACAAAA GAGGAAATCA ATCGACAAAA ACAAGAGCAT AGTCAACATC 3 480 

GTGAAGGTGG AACTCCAAGA AACGATGGTG CTGTTGCCTT GGCACGTTCG CAAGGACGCT 3 540 

ATACTACAGA TGATGGTTAT ATCTTTAATG CTTCTGATAT CATAGAGGAT ACTGGTGATG 3 600 

CTTATATCGT TCCTCATGGA GATCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA 3 660 

GCGAGTTGGC TGCTGCAGAA GCCTTCCTAT CTGGTCGAGG AAATCTGTCA AATTCAAGAA 3720 

CCTATCGCCG ACAAAATAGC GATAACACTT CAAGAACAAA CTGGGTACCT TCTGTAAGCA 3780 

ATCCAGGAAC TACAAATACT AACACAAGCA ACAACAGCAA CACTAACAGT CAAGCAAGTC 3 840 

AAAGTAATGA CATTGATAGT CTCTTGAAAC AGCTCTACAA ACTGCCTTTG AGTCAACGAC 3 9 00 
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ATGTAGAATC TGATGGCCTT GTCTTTGATC CAGCACAAAT CACAAGTCGA ACAGCTAGAG 3 960 

GTGTTGCAGT GCCACACGGA GATCATTACC ACTTCATCCC TTACTCTCAA ATGTCTGAAT 4020 

TGGAAGAACG AATCGCTCGT ATTATTCCCC TTCGTTATCG TTCAAACCAT TGGGTACCAG 40 BO 

ATTCAAGGCC AGAACAACCA AGTCCACAAC CGACTCCGGA ACCTAGTCCA GGCCCGCAAC 4140 

CTGCACCAAA TCTTAAAATA GACTCAAATT CTTCTTTGGT TAGTCAGCTG GTACGAAAAG 42 00 

TTGGGGAAGG ATATGTATTC GAAGAAAAGG GCATCTCTCG TTATGTCTTT GCGAAAGATT 42 60 

TACCATCTGA AACTGTTAAA AATCTTGAAA GCAAGTTATC AAAACAAGAG AGTGTTTCAC 4320 

ACACTTTAAC TGCTAAAAAA GAAAATGTTG CTCCTCGTGA CCAAGAATTT TATGATAAAG 43 80 

CATATAATCT GTTAACTGAG GCTCATAAAG CCTTGTTTGA AAATAAGGGT CGTAATTCTG 4440 

ATTTCCAAGC CTTAGACAAA TTATTAGAAC GCTTGAATGA TGAATCGACT AATAAAGAAA 4500 

AATTGGTAGA TGATTTATTG GCATTCCTAG CACCAATTAC CCATCCAGAG CGACTTGGCA 4 5 60 

AACCAAATTC TCAAATTGAG TATACTGAAG ACGAAGTTCG TATTGCTCAA TTAGCTGATA 4 62 0 

AGTATACAAC GTCAGATGGT TACATTTTTG ATGAACATGA TATAATCAGT GATGAAGGAG 4680 

ATGCATATGT AACGCCTCAT ATGGGCCATA GTCACTGGAT TGGAAAAGAT AGCCTTTCTG 4 74 0 

ATAAGGAAAA AGTTGCAGCT CAAGCCTATA CTAAAGAAAA AGGTATCCTA CCTCCATCTC 4 800 

CAGACGCAGA TGTTAAAGCA AATCCAACTG GAGATAGTGC AGCAGCTATT TACAATCGTG 48 60 

TGAAAGGGGA AAAACGAATT CCACTCGTTC GACTTCCATA TATGGTTGAG CATACAGTTG 4920 

AGGTTAAAAA CGGTAATTTG ATTATTCCTC ATAAGGATCA TTACCATAAT ATTAAATTTG 4980 

CTTGGTTTGA TGATCACACA TACAAAGCTC CAAATGGCTA TACCTTGGAA GATTTGTTTG 5040 

CGACGATTAA GTACTACGTA GAACACCCTG ACGAACGTCC ACATTCTAAT GATGGATGGG 5100 

GCAATGCCAG TGAGCATGTG TTAGGCAAGA AAGACCACAG TGAAGATCCA AATAAGAACT 516 0 

TCAAAGCGGA TGAAGAGCCA GTAGAGGAAA CACCTGCTGA GCCAGAAGTC CCTCAAGTAG 52 2 0 

AGACTGAAAA AGTAGAAGCC CAACTCAAAG AAGCAGAAGT TTTGCTTGCG AAAGTAACGG 52 80 

ATTCTAGTCT GAAAGCCAAT GCAACAGAAA CTCTAGCTGG TTTACGAAAT AATTTGACTC 53 4 0 

TTCAAATTAT GGATAACAAT AGTATCATGG CAGAAGCAGA AAAATTACTT GCGTTGTTAA 5400 

AAGGAAGTAA TCCTTCATCT GTAAGTAAGG AAAAAATAAA CTAATGAAAA ATGAAAGTCT 5460 

CGATAAAGAG GCTTTCATTT TTATTATGTA TATATGTAAA ATTCTTGACA AGCAATATTA 5520 

AAAAGAGTAA ACTATTAACT AGTTAATTAA CCGGTTTATT ACTTTATAGT GAATCAAATA 55 80 

TACTTAAGAA AAGAGGAAAG AATGAAAATT AATAAAAAAT ATCTAGCAGG TTCAGTGGCA 564 0 

GTCCTTGCCC TAAGTGTTTG TTCCTATGAA CTTGGTCGTC ACCAAGCTGG TCAGGTTAAG 5700 
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AAAGAGTCTA ATCGAGTTkC TTATATAGAT GGTGATCAGG CTGGTCAAAA GGCAGAAAAC 57 60 

TTGACACCAG ATGAAGTCAG TAAGAGGGAG GGGATCAACG CCGAACAAAT CGTCATCAAG 582 0 

ATTACGGATC AAGGTTATGT GACCTCTCAT GGAGACCATT ATCATTACTA TAATGGCAAG 5880 

GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA AAGATCCGAA TTATCAGTTG 594 0 

AAGGATTCAG ACATTGTCAA TGAAATCAAG GGTGGTTATG TTATCAAGGT AGATGGAAAA 600 0 

TACTATGTTT ACCTTAAGGA TGCAGCTCAT GCGGATAATA TTCGGACAAA AGAAGAGATT 60 60 

AAACGTCAGA AGCAGGAACA CAGTCATAAT CACGGGGGTG GTTCTAACGA TCAAGCAGTA 6120 

GTTGCAGCCA GAGCCCAAGG ACGCTATACA ACGGATGATG GTTATATCTT CAATGCATCT 6180 

GATATCATTG AGGACACGGG TGATGCTTAT ATCGTTCCTC ACGGCGACCA TTACCATTAC 624 0 

ATTCCTAAGA ATGAGTTATC AGCTAGCGAG TTAGCTGCTG CAGAAGCCTA TTGGAATGGG 63 0 0 

AAGCAGGGAT CTCGTCCTTC TTCAAGTTCT AGTTATAATG CAAATCCAGC TCAACCAAGA 63 6 0 

TTGTCAGAGA ACCACAATCT GACTGTCACT CCAACTTATC ATCAAAATCA AGGGGAAAAC 642 0 

ATTTCAAGCC TTTTACGTGA ATTGTATGCT AAACCCTTAT CAGAACGCCA TGTGGAATCT 64 80 

GATGGCCTTA TTTTCGACCC AGCGCAAATC ACAAGTCGAA CCGCCAGAGG TGTAGCTGTC 6540 

CCTCATGGTA ACCATTACCA CTTTATCCCT TATGAACAAA TGTCTGAATT GGAAAAACGA 660 0 

ATTGCTCGTA TTATTCCCCT TCGTTATCGT TCAAACCATT GGGTACCAGA TTCAAGACCA 6660 

GAACAACCAA GTCCACAATC GACTCCGGAA CCTACTCCAA GTCCGCAACC TGCACCAAAT 6720 

CCTCAACCAG CTCCAAGCAA TCCAATTGAT GAGAAATTGG TCAAAGAAGC TGTTCGAAAA 6780 

GTAGGCGATG GTTATGTCTT TGAGGAGAAT GGAGTTTCTC GTTATATCCC AGCCAAGGAT 6840 

CTTTCAGCAG AAACAGCAGC AGGCATTGAT AGCAAACTGG CCAAGCAGGA AAGTTTATCT 6900 

CATAAGCTAG GAGCTAAGAA AACTGACCTC CCATCTAGTG ATCGAGAATT TTACAATAAG 696 0 

GCTTATGACT TACTAGCAAG AATTCACCAA GATTTACTTG ATAATAAAGG TCGACAAGTT 702 0 

GATTTTGAGG CTTTGGATAA CCTGTTGGAA CGACTCAAGG ATGTCyCAAG TGATAAAGTC 708 0 

AAGTTAGTGG ATGATATTCT TGCCTTCTTA GCTCCGATTC GTCATCCAGA ACGTTTAGGA 7140 

AAACCAAATG CGCAAATTAC CTACACTGAT GATGAGATTC AAGTAGCCAA GTTGGCAGGC 7200 

AAGTACACAA CAGAAGACGG TTATATCTTT GATCCTCGTG ATATAACCAG TGATGAGGGG 72 60 

GATGCCTATG TAACTCCACA TATGACCCAT AGCCACTGGA TTAAAAAAGA TAGTTTGTCT 7320 

GAAGCTGAGA GAGCGGCAGC CCAGGCTTAT GCTAAAGAGA AAGGTTTGAC CCCTCCTTCG 7380 

ACAGACCATC AGGATTCAGG AAATACTGAG GCAAAAGGAG CAGAAGCTAT CTACAACCGC 7440 
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GAAGTCAAAA ACGGTAGTTT AATCATACCT CATTATGACC ATTACCATAA CATCAAATTT 7 5 60 

GAGTGGTTTG ACGAAGGCCT TTATGAGGCA CCTAAGGGGT ATACTCTTGA GGATCTTTTG 7 62 0 

GCGACTGTCA AGTACTATGT CGAACATCCA AACGAACGTC CGCATTCAGA TAATGGTTTT 7680 

GGTAACGCTA GCGACCATGT TCGTAAAAAT AAGGTAGACC AAGACAGTAA ACCTGATGAA 7 74 0 

GATAAGGAAC ATGATGAAGT AAGTGAGCCA ACTCACCCTG AATCTGATGA AAAAGAGAAT 7 800 

CACGCTGGTT TAAATCCTTC AGCAGATAAT CTTTATAAAC CAAGCACTGA TACGGAAGAG 7860 

ACAGAGGAAG AAGCTGAAGA TACCACAGAT GAGGCTGAAA TTCCTCAAGT AGAGAATTCT 7920 

GTTATTAACG CTAAGATAGC AGATGCGGAG GCCTTGCTAG AAAAAGTAAC AGATCCTAGT 7980 

ATTAGACAAA ATGCTATGGA GACATTGACT GGTCTAAAAA GTAGTCTTCT TCTCGGAACG 804 0 

AAAGATAATA ACACTATTTC AGCAGAAGTA GATAGTCTCT TGGCTTTGTT AAAAGAAAGT 8100 

CAACCGGCTC CTATACAGTA GTAAAATGAA TGGAGCATAT TTTATGGAGA AGTAACCTTT 816 0 

CGTGTTACTT CTCTTTTTTA GAAAAACGTA ACAGA 819 5 
(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

TTTACTAAAA GGAAAAAAGA ACTGATTTCT CAGTCCTTCA TTAATCTTAT TCCACACTAA 60 

ATAGGTATGG GTAAACAGGT TGTTGACCTT GGTGAATCTC GACTTCAACG TCTTCGAATT 12 0 

CTTCTACGAT TTCTTGAGCG ATTTCATTGG CAAGTTCTTC GCTTCCGTCT TCACCTACAT 180 

AGAAGGTTAC GATTTCACTG TCTTCATCCA ACATATGTTT CAAGGTTTCA GTCAATGl TT 24 0 

GGTGCATATC AGGGTTTGAC ACAAGAATTT TTCCATCCAC CATACCTAAA TTATCGTTTT 3 00 

CATGGATTTC TAAGCCATCG ATCGTTGTAT CACGCACGGC TGTTGTGACG CTTCCGCTAA 360 

CGACATCGCT AAGAGCAGCT GTCATACGCT CTTGGTTTTC TTCAATGGAC TTGCTTGGAT 42 0 

CAAAGGCAAG AAGACTTGTC ATACCTTGAG GAAGAGTGCG AGCCTCTACC ACTACCGCTG 480 

GTTGCTCCAA AACTTCTGCC GCAGATTGAG CTGCCATGAA GATGTTCTTG TTGTTTGGCA 54 0 

AGAAGATGAT GTTACGGGCA TTAACCTGTT CAACAGCCTT GATAAAGTCT TCTGTTGAAG 600 

GGTTCATGGT TTGACCGCCT TCGATAACAT AATCCACGCC TTGAGAACAG AAGATATCTG 660 
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CTAGACCTTT ACCAGCCACC ACAGCAATCA AAGCATACTC TTTTTCTTCA GCCGACTTGA 720 

TAACTTGAGT AGCTTCTTTC TCAACCTGTG CTTCGTGTTG GTTACGCATA TTGTCAACTT 780 

TTACCTTGAC CAAGCTACCA TATTTGAGAC CTTCTTGCAT AACAAGTCCT GGATCTTCTG 840 

TATGAACATG GACTTTGACA ATTTCATCAT CGTTAACAAC AAGGAGAGAA TCTCCAAGCT 900 

CATCCAAGTA GTTACGGAAT TCATCGTAGT CAAAATCTTT AGCATAGGTT GGACCTTGCT 960 

TAAGAGCTAC CATGATTTCA GTACAGTAAC CAAACGTGAT GTCCTCAGTC GCTACGTGAC 1020 

CAGCTACAGA CTTATGATGC TCTACATTGA TCATCTCACT CATGTTGGCA GGAGTCGCTA 1080 

CAAAGTCCTC AGATGCAATA TATTCGCCAG TAAGGGCTGA AAGGAAACCT TCGTAGATGA 1140 

AGACCAATCC TTGACCACCT GAGTCCACAA CGCCAACTTC TTTCAATACT GGAAGCATGT 1200 

CTGGTGTTTT AGCTAGAGCT GTTTTAGCAC CTTCCAAGGC TGCGCGCATG ACTTCAACAG 1260 

CGTCATCTGT TTGCTCAGCT TTTTTCTTAG CACCGATAGC AGCTCCACGA GAAACTGTTA 13 20 

AAATCGTTCC TTCAACAGGT TTCATCACTG CCTTATAGGC AACTTCCACA CCTGATTGGA 1380 

AGGCCAGAGC CAAGTCTTGA CCTGTTAACT CGTCTTTATC CTTGATAGCT TGGGAAAATC 1440 

CACGGAAAAG CTGAGACGTA ATCACTCCTG AGTTCCCACC CGCACCCATC AAAAGCCCTT 1500 

TGGCAAGAAT GCTCGCTACT TCTCCAACTG TAGAAGCTGG CTTGTCTGCA ACTTCTTTAG 15 60 

CACCATTTTC AATGGTCATT CCCATATTTG TCCCAGTATC TCCATCTGGA ACTGGAAAGA 1620 

CGTTTAATGA ATTGACATAT TCAGCTTGCT TATTCAAGCG AGTTGATGCA GCCTGCACCA 1680 

TTTCTTGAAA TAAGCTAGTA GTAATTTTTG ACACGGTTAT TCTCCTACAA CTTTGATATT 1740 

TTGAATGTAG ACATTTACAG TCTGAGCAGT AATTCCAAGC TGGTTTTCCA AGCTAAAGGC 1800 

AACACGCTCT TGAATGTTTT TTGACACTTC ACTAATCTTT GTTCCGTAGC TTAACACGGT 18 60 

ATATACATCA ACTGCAATAC TGCCATCTTC GGCTGCCTTT ACGACGACAC CTTTAGAATA 1920 

ATTTTCCTTA CCTAGCAGGG CTTGGAAATT ATCTTTGAGG GCATTTTTAC TAGCCATACC 1980 

GACCACACCA GAAATCTCAG TTGC 2 004 
(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 
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CCGGGTTGGG CTGTTCGCCC ATTAAAGCGG CACCACAGCT GGGTTCAGAA CGTCGTGAGA 60 

CAGTTCGGTC CCTATCCGTC GCGGGCGTAG GAAATTTGAG AGGATCTGCT CCTAGTACGA 120 

GAGGACCAGA GTGGACTTAC CGCTGGTGTA CCAGTTGTCT TGCCAAAGGC ATCGCTGGGT 180 

AGCTATGTAG GGAAGGGATA AACGCTGAAA GCATCTAAGT GTGAAACCCA CCTCAAGATG 240 

AGATTTCCCA TGATTATATA TCAGTAAGAG CCCTGAGAGA TGATCAGGTA GATAGGTTAG 300 

AAGTGGAAGT GTGGCGACAC ATGTAGCGGA CTAATACTAA TAGCTCGAGG ACTTATCCAA 3 60 

AGTAACTGAG AATATGAAAG CGAACGGTTT TCTTAAATTG AATAGATATT CAATTTTGAG 420 

TAGGTATTAC TCAGAGTTAA GTGACGATAG CCTAGGAGAT ACACCTGTAC CCATGCCGAA 480 

CACAGAAGTT AAGCCCTAGA ACGCCGGAAG TAGTTGGGGG TTGCCCCCTG TGAGATAGGG 540 

AAGTCGCTTA GCTCTAGGGA GTTTAGCTCA GCTGGGAGAG CATCTGCCTT ACAAGCAGAG 600 

GGTCAGCGGT TCGATCCCGT TAACTCCCAT TTTAGCGGGT GTAGTTTAGT GGTAAAACTA 660 

CAGCCTTCCA AGCTGTTGTC GCGAGTTCGA TTCTCGTCAC CCGCTTTGAA CTTTGTTCTT 720 

TGTACCAAGT TTTTGACTTG GGCGCGTAGC TCAGGTGGTT AGAGCGCACG CCTGATAAGC 7 80 

GTGAGGTCGG TGGTTCGAGT CCACTCGTGC CCATAGTGTT TAGTCCATTA CTAGGGGATT 840 

GGAATATTAT CTGTTCACTA AGAGGACACG GGCTTGTTCC CGTATAAACT ATTTTGGAGG 900 

ATTACCCAAG TCCGGCTGAA GGGAACGGTC TTGAAAACCG TCAGGCGTGT AAAAGCGTGC 960 

GTGGGTTCGA ATCCCACATC CTCCTTTTAT ATTAACGCGG GATGGAGCAG CTCGGTAGCT 1020 

CGTCGGGCTC ATAACCCGAA GGTCGTAGGT TCAAATCCTG CTCCCGCAAT AAGGCTCGGT 1080 

AGCTCAGTTG GTAGAGCAAT GGATTGAAGC TCCATGTGTC GGCGGTTCGA TTCCGTCTCG 1140 

CGCCATTTAT ATATTTTGGA AGGGTAGCGA AGAGGCTAAA CGCGGCGGAC TGTAAATCCG 1200 

CTCCTTCGGG TTCGGGGGTT CGAATCCCTC CCCTTCCATT TTACGGGCAT AGTTTAAAGG 12 60 

TAGAACTAAG GTCTCCAAAA CCTTCAGTGT GGGTTCAATT CCTACTGCCC GTGTTAATAG 132 0 

AATTATGGCG GGTGTGGTGA AGTGGTTAAC ACACCAGATT GTGGCTCTGG CATGCGTGGG 13 80 

TTCGATCCCC ATCACTCGCC TATTTTATAT TGGGGTATAG CCAAGCGGTA AGGCAAGGGA 144 0 

CTTTGACTCC CTCATGCGTT GGTTCGAATC CAGCTACCCC AGTTACTATT TGCCGGCGTG 1500 

GCGGAATTGG CAGACGCGCT GGACTCAAAA TCCAGTGTCC GCAAGGACGT GCCGGTTCGA 15 60 

CCCCGGCCGC CGGTATAGTA TAGTGTTAGG AACGTTGTTA TTCTTCGTTC CTTTTTTATA 162 0 

TTATTTTTGG TATAATTATA GTTATTCAAA TTTTATTTAG ATTAAGAAAG TGTAGGGGAG 1680 

TATGTCTTGT TCTATCGATT TATTAAAACA TCGGTATTTG AAAAATATTA AAGAAAATCC 1740 

TGAATTGTTT GTCGGAATTG AGTTGGAGTA TCCTGTTGCA AGTTTAGAAG GGGATGCTAC 1800 
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AGATGTTGAA GTTATGAAGG ATCTATTTCA TTATTTAGTT TCTACTTTGG ATCTCACCGT 18 60 

AGCAAAGGTA GATGATTTTG GCAATCTGAT CCAGTTAGTA GATCCGATAA GTCAGGATGC 1920 

TATTTTATTT GAAGTTTCCT ATACAACGAT TGAGTTTGCA TTTGGTAAGG CTGAAACGAT 19 80 

TCAAGAGGTC GAAAATCGTT TCAATAATTA TATGAATGTA ATTCAGAGAA AGTTAGCTGA 2 040 

ATCAAATCAT GCTATTGTTG GCTGTGGTAT CCATCCCAAC TGGGATAAAA ATGAGAATTG 2100 

TCCAGTGGCT TATCCACGCT ATCAGATGTT GATGGATTAT TTGAATTTGA GTAGAAATAT 2160 

TATTAAATCA GATTTACATC ATTTCCCTGA ATATGGTACT TTTATCTGTG GGAGCCAGGT 222 0 

TCAGCTGGAT ATTTCAAAAA CCAACTACTT ACGGGTGATT AATGCTTTTA CTCAAATTGA 22 80 

AGCGGCTAAG GCTTATTTAT TTGCAAACTC TGAATTTTCG GGTGCGGATT GGGATACGAA 234 0 

AATTTCAAGG GATATTTTCT GGGAAGAATC TATGCATGGT ATCTATCCAG AGAATGTTGG 2400 

GGTCAATGCT AGACTCCTTA ATGATGAAAC TGATTTTTTT GACTATCTAA ATCATTCTGC 24 60 

GATTTTTACT GCGGAACGTG ATGGGCAGAC CTATTATTTT TATCCTATTC AGGCTGGGGA 252 0 

CTATTTGGCT ACGTCCGAAA TCCAAGCATT TGCTCTGAAT GGGGATGAGG TTATTATTTA 2580 

CCCCCAAGAG AAGGATTTTG AAACTCATCG TAGTTACCAG TACCAAGATT TAACGACTCG 2 64 0 

AGGAACAGTT GAGTTTCGTA GTGTGTGTAC ACAGCCACTT GATAGGACTT TTGCTTCTGC 2700 

AGCTTTTCAC TTGGGATTAT TGGTTAATTT AGACAAGTTA GAAGCTTACT TAGAAACAGC 27 60 

ACCTTTCTTT AAAGTATTTG GTTATGATTA CAAGTCTTTA AGGAGACAAT TTTCTAAGAA 282 0 

AAATCTTACA GATGAGGAAG AAACTACGAT TATTGAATTT TCCAAAGACT TACTCCTACT 2 880 

AGCTGAGGAG GGACTAGTGG TGAGAAATAA GGAAGAAATG ACCTATTTAC AGCCTTTGAG 2 94 0 

AGAAGAATTG AGCCTATAAT TTCTCTTATA AAGGGAGAAT TTTCTGAAAA ATCATGATAT 3000 

AATGGACGAG ACTATAGATA AAGGATAGAG AGTAATGACA TTAGTTTATC AATCAACGCG 3060 

TGATGCCAAC AATACAGTAA CTGCCAGCCA AGCAATTTTG CAAGGTTTGG CGACGGACGG 312 0 

CGGTTTGTTT ACACCGGATA CTTATCCAAA GGTAGATTTG AACTTTGACA AATTGAAAGA 3180 

TGCTTCTTAC CAGGAAGTTG CTAAGCTAGT TTTGTCAGCA TTTTTAGATG ACTTTACAGT 324 0 

TGAGGAGTTG GACTACTGTA TCAACAATGC CTACGATAGC AAATTTGATA CTCCAGCTAT 33 00 

TGCACCATTA GTGAAATTAG ATGGGCAATA CAATTTGGAA CTTTTCCATG GTTCAACGAT 33 60 

TGCCTTTAAG GATATGGCCT TGTCTATTTT GCCATACTTT ATGACGACTG CTGCTAAGAA 3 42 0 

ACATGGTTTG GAGAACAAGA TTGTTATCTT GACAGCGACA TCTGGTGACA CGGGGAAAGC 348 0 

TGCTATGGCG GGGTTTGCGA ATGTGCCTGG TACTGAGATT ATCGTCTTTT ATCCAAAGGA 3 540 
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TGGTGTCAGC AAGATTCAAG AGTTACAAAT GACCACTCAG ACTGGCGACA ATACTCATGT 3 600 

TATTGCTATT GATGGTAACT TTGACGATGC GCAAACAAAT GTGAAGCACA TGTTTAACGA 36 60 

CGTGGCTCTT CGTGAAAAAT TGACTACCAA CAAGTTGCAA TTTTCATCAG CTAACTCTAT 372 0 

GAACATTGGT CGTCTGGTGC CACAAATTGT TTATTATGTT TATGCTTACG CTCAATTGGT 3 7 80 

TAAGACTGGT GAAATTGTAG CTGGTGAAAA GGTTAACTTC ACAGTACCAA CAGGAAACTT 3 84 0 

TGGAAATATC TTGGCTGCCT TTTATGCCAA ACAAATTGGT TTGCCAGTTG GTAAATTAAT 3 900 

CTGTGCTTCA AATGACAACA ATGTTTTGAC AGACTTCTTT AAAACACGTG TCTATGACAA 39 60 

AAAACGTGAG TTTAAGGTAA CAACCAGCCC ATCTATGGAT ATCTTGGTAT CTTCAAACTT 402 0 

GGAGCGCTTG ATTTTCCATC TTTTGGGAAA TAATGCTGAA AAGACAACTG AACTTATGAA 4080 

TGCCTTGAAC ACGCAAGGAC AATATAAGTT GACAGACTTT GATGCAGAGA TTTTGGACCT 4140 

CTTTGCAGCT GAATATGCGA CTGAGGAAGA AACGGCAGCA GAGATCAAGC GTGTTTGTGA 42 00 

GTTAGATTCT TATATCGAGG ACCCTCATAC AGCTGTTGCT TCAGCAGTTT ATAAAAAATA 42 60 

CCAATCGGCC ACTGGAGATG TAACTAAGAC AGTGATTGCT TCAACAGCTA GTCCATACAA 43 2 0 

GTTCCCAGTA GTTGCAGTAG AAGCTGTAAC TGGAAAAGCA GGTTTAACAG ACTTTGAAGC 43 80 

CTTGGCTCAA TTACATGAAA TCTCAGGCGT TGCAGTGCCA CCAGCACTTG ATGGGCTTGA 444 0 

AATAGCTCCA ATTCGTCACA AGACAACAGT GGCAGCTGCT GACATGCAAG CAGCGGTTGA 45 00 

GGCTTATTTA GGACTTTAAG ACAGAGGGAG CAAACTCGGT TGGGAAACCA ACTGAGTTTC 4560 

TTTTCATCAG GAGGAGAGAT TGTTTAAGAA AAATAAAGAC ATTCTTAATA TTGCATTGCC 4620 

AGCTATGGGT GAAAACTTTT TGCAGATGCT AATGGGAATG GTGGACAGTT ATTTGGTTGC 4680 

TCATTTAGGA TTGATAGCTA TTTCAGGGGT TTCAGTAGCT GGTAATATTA TCACCATTTA 4740 

TCAGGCGATT TTCATCGCTC TGGGAGCTGC TATTTCCAGT GTTATTTCAA AAAGCATAGG 4800 

GCAGAAAGAC CAGTCGAAGT TGGCCTATCA TGTGACTGAG GCGTTGAAGA TTACCTTACT 4 86 0 

ATTAAGTTTC CTTTTAGGAT TTTTGTCCAT CTTCGCTGGG AAAGAGATGA TAGGACTTVT 492 0 

GGGGACGGAG AGGGATGTAG CTGAGAGTGG TGGACTGTAT CTATCTTTGG TAGGCGGATC 4980 

GATTGTTCTC TTAGGTTTAA TGACTAGTCT AGGAGCCTTG ATTCGTGCAA CGCATAATCC 5040 

ACGTCTGCCT CTCTATGTTA GTTTTTTATC CAATGCCTTG AATATTCTTT TTTCAAGTCT 5100 

AGCTATTTTT GTTCTGGATA TGGGGATAGC TGGTGTTGCT TGGGGGACAA TTGTGTCTCG 5160 

TTTGGTTGGT CTTGTGATTT TGTGGTCACA ATTAAAACTG CCTTATGGGA AGCCAACTTT 5220 

TGGTTTAGAT AAGGAACTGT TGACCTTGGC TTTACCAGCA GCTGGAGAGC GACTTATGAT 5280 

GAGGGCTGGA GATGTAGTGA TCATTGCCTT GGTCGTTTCT TTTGGGACGG AGGCAGTTGC 5340 
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TGGGAATGCA ATCGGAGAAG TCTTGACCCA GTTTAACTAT ATGCCTGCCT TTGGCGTCGC 5400 

TACGGCAACG GTCATGCTGT TGGCCCGAGC AGTTGGAGAG GATGATTGGA AAAGAGTTGC 5460 

TAGTTTGAGT AAACAAACCT TTTGGCTTTC TCTGTTCCTC ATGTTGCCCC TGTCCTTTAG 5 520 

TATATATGTC TTGGGTGTAC CATTAACTCA TCTCTATACG ACTGATTCTC TAGCGGTGGA 55 80 

GGCTAGTGTT CTAGTGACAC TGTTTTCACT ACTTGGGACC CCTATGACGA CAGGAACAGT 5 640 

CATCTATACG GCAGTCTGGC AGGGATTAGG AAATGCACGC CTCCCTTTTT ATGCGACAAG 57 00 

TATAGGAATG TGGTGTATCC GCATTGGGAC AGGATATCTG ATGGGGATTG TGCTTGGTTG 57 60 

GGGCTTGCCT GGTATTTGGG CAGGGTCTCT CTTGGATAAT GGTTTTCGCT GGTTATTTCT 5 820 

ACGCTATCGT TACCAGCGCT ATATGAGCTT GAAAGGATAG GAAATGCAAA AAACAGCTTT 5880 

TATTTGGGAT TTAGACGGGA CTTTATTGGA CTCTTACGAA GCGATTTTAT CAGGGATTGA 5940 

GGAGACTTTT GCTCAGTTTT CTATTCCTTA TGATAAGGAG AAGGTGAGAG AGTTTATCTT 6000 

CAAGTATTCG GTGCAAGATT TGCTTGTGCG GGTGGCAGAA GATAGAAATC TGGATGTTGA 606 0 

GGTGCTAAAT CAGGTGCGTG CCCAGAGTCT GGCTGAGAAG AATGCTCAGG TAGTTTTGAT 612 0 

GCCAGGTGCG CGTGAGGTGC TAGCTTGGGC AGACGAATCA GGAATTCAGC AGTTTATATA 6180 

TACTCATAAG GGGAACAACG CTTTTACCAT TCTCAAGGAC TTGGGGGTGG AATCCTATTT 62 4 0 

TACAGAGATT TTAACCAGTC AGAGTGGCTT TGTGCGGAAG CCAAGTCCAG AAGCGGCTAC 6300 

CTATCTGCTA GATAAGTATC AGTTGAATTC TGATAATACT TATTATATAG GGGATCGGAC 63 60 

TCTGGATGTG GAATTTGCCC AGAATAGTGG GATTCAAAGC ATCAACTTTT TAGAGTCTAC 64 2 0 

TTATGAAGGG AATCACAGGA TTCAAGCGTT AGCAGATATT TCCCGTATTT TTGAGACTAA 64 80 

GTGATAAAAA GATTGTGTCA GTTTTGTGAC AGAGACCTAA CAAACTATTT CAAGTAACCT 654 0 

AGTTTGTTAC AAGGAATAGA CAGTTCTGTT AAATAGGCCC GAGAGGGCTT TTTTT CT AC A 6600 

TTTTTTGTGT TATGATAGAC AGGTACTCAT TTGAAAGGAA TTTGAAAGAA TGAAGAAAAG 66 60 

AATGTTATTA GCGTCAACAG TAGCCTTGTC ATTTGCCCCA GTATTGGCAA CTCAAGCAGA 672 0 

AGAAGTTCTT TGGACTGCAC GTAGTGTTGA GCAAATCCAA AACGATTTGA CTAAAACGGA 6780 

CAACAAAACA AGTTATACCG TACAGTATGG TGATACTTTG AGCACCATTG CAGAAGCCTT 6840 

GGGTGTAGAT GTCACAGTGC TTGCGAATCT GAACAAAATC ACTAATATGG ACTTGATTTT 69 00 

CCCAGAAACT GTTTTGACAA CGACTGTCAA TGAAGCAGAA GAAGTAACAG AAGTTGAAAT 6960 

CCAAACACCT CAAGCAGACT CTAGTGAAGA AGTGACAACT GCGACAGCAG ATTTGACCAC 702 0 

TAATCAAGTG ACCGTTGATG ATCAAACTGT TCAGGTTGCA GACCTTTCTC AACCAATTGC 7080 
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CCCAGAGGAG CAAACGACCG AAACAACTCG CCCAGTTGAA GAAGCAACTC CTCAGGAAAC 72 00 

GACTCCAGCT GAGAAGCAGG AAACACAAGC AAGCCCTCAA GCTGCATCAG CAGTGGAAGT 72 60 

AACTACAACA AGTTCAGAAG CAAAAGAAGT AGCATCATCA AATGGAGCTA CAGCAGCAGT 73 2 0 

TTCTACTTAT CAACCAGAAG AGACGAAAAT AATTTCAACA ACTTACGAGG CTCCAGCTGC 73 80 

GCCCGATTAT GCTGGACTTG CAGTAGCAAA ATCTGAAAAT GCAGGTCTTC AACCACAAAC 7440 

AGCTGCCTTT AAAGAAGAAA TTGCTAACTT GTTTGGCATT ACATCCTTTA GTGGTTATCG 7500 

TCCAGGAGAC AGTGGAGATC ACGGAAAAGG TTTGGCTATC GACTTTATGG TACCAGAACG 7560 

TTCAGAATTA GGGGATAAGA TTGCGGAATA TGCTATTCAA AATATGGCCA GCCGTGGCAT 7 62 0 

TAGTTACATC ATCTGGAAAC AACGTTTCTA TGCTCCATTC GATAGCAAAT ATGGGCCAGC 7 680 

TAACACTTGG AACCCAATGC CAGACCGTGG TAGTGTGACA GAAAATCACT ATGATCACGT 774 0 

TCACGTTTCA ATGAATGGAT AAACCCGACT TGATAACATC ATTTTGACGA ATGAGATCTA 78 00 

GCTTTCGTGA TGGAAAGCGA TTCTCGTTCG TTTTTTCTTT GTCATACTCT TCGAAAATCT 78 60 

CTTCAAACCA CGTCAGTTTT ATCTGAAACT TCAAAGCTGT GCTTTGAGCA ACCTGCGACT 7 92 0 

AGCTTCCTAG TTTGCTTTTT GATTTTCATT GAGTATCAAT TTGAATGGAA AATGGAAAGT 79 8 0 

TATCATCTTG TAATGAGTTA AGCAACATTC TTGCAATCTA TTTTACTTTA TATCACAATT 804 0 

AATTAGTCAA ATATTGATAA ATCAATAAAA AGAGAGGGGA AGAAATGCTA GAGATTCAAG 8100 

ATTTACTGTA TCAACTCCGC TTGTCTGAGC AAGCGAGTAC GCAATTGTTT GAAAAAAGGC 816 0 

TTGGGATTAG TTTGACACGG TATCAGATTT TACTGTTTTT GCTGGAGCAT TCTCCTTGTA 822 0 

ACCAAATGGC GGTTCAGGAG CGTTTGAAAA TTGATCAGGC TGCTTTGACA CGGCATTTCA 8280 

AAATTTTGGA AACGGAAGGT TTGGTGGAGC GTCATCGTAA TCCTGAAAAT CAGCGGGAAG 834 0 

TGTTGGTAGA GGCTGCGAAG TATGCCAAGG AGCAGTTAGT GGTGAATCCC CCTCTGCAAC 8400 

ATATCAGGGT TAAGGAAGAG ATAGAAAGTA TCTTAACAGA GTTTGAGAGA ACAGAACTCA 8460 

GCCGTTTATT AAATAAATTG GTTTTGGGTA TTGAAAATAT AGAAATTTAA GGAGAAATAG 852 0 

ATGTCAATTA TTTTAACAAC GATCGTTGCT TTGGAGCATT TTTACATTTT TTATTTGGAA 8580 

AGTATTGCCA CGCAATCAGA TGCGACTAGT CGTGTATTTA ATATGGAAAA GGAAGAATTG 864 0 

GCTCATCCGT CAGTAAGTTC ATTGTTCAAA AATCAAGGAA TTTATAAGGC TCTGCTAGGA 8700 

GTCTTTCTCT TGTATGTCAT TTATTTCTCA CAGAATTTAG AAATTGTGAC TATTTTTGTC 8760 

TTATTTGTGA TTGGTGCTGC GACTTACGGC TCTTTAACAG CGGATAAAAA AATTATTTTG 882 0 

AAACAAGGTG GATCAGCTAT TTTGGCCTTG ATTAGTATTT TACTCTTTAA ATACACTTGA 8880 
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AGGTCGATTC TAATCTCGCT AATCCTTTTT AATCCAGAAT AAGGGAAATA TGTTATACTT 8 940 

GTTTTTAAGA AAAAAGTCTC ATTGAATTGG TTTTGAGGAG TTAGAAATGA AAGTATTAGT 9000 

GACAGGTTTT GAGCCCTTTG GAGGGGAAAA GGGCAATCCA GCTTTGGAGG CCATTAAAGG 9 0 60 

TTTACCAGCT GAAATCCATG GTGCTGAGGT CCGTTGGCTA GAGGTGCCGA CAGTTTTTCA 912 0 

CAAATCTGCT CAAGTATTGG AAGAAGAGAT GAATCGTTAT CAACCTGACT TTGTCCTTTG 9180 

TATTGGGCAA GCTGGTGGAA GAACTAGTTT GACACCTGAA CGAGTGACCA TTAATCAAGA 92 40 

CGATGCATGC ATTTCTGATA ACGAAGATAA TCAACCGATT GACCGTCCCA TTCGCCCAGA 9300 

TGGTGCTTCG GCCTACTTTA GTAGTTTGCC GATTAAAGCG ATGGTTCAAG CTATAAAAAA 93 60 

AGAGGGCTTA CCGGCCTCTG TTTCCAATAC GGCAGGGACT TTTGTCTGCA GCCATTTGAT 942 0 

GTATCAGGCT CTCTATTTGG TAGAAAAGAA ATCTCCATAT GTTAAGGCAG GTTTTATGCA 94 80 

TATTCCTTAT ATGATGGAAC AGGTGGTGAA CAGACCGACT ACTCCAGCTA TGAGTTTAGT 954 0 

GGATATTCGG CGAGGGATAG AAGCAGCAAT CGGCGCTATA ATAGAACATG GAGATCAGGA 9600 

ACTCAAGTTG GTAGGCGGAG AAACTCATTG ATAGAAAAAA GCTTGAGGGG AAAAACCTTC 96 60 

AAGCTTTTGG ACGTTTTCGG GCCAATACTG CTCGGTAAAA CATAATTTTA GTGCATTGGA 972 0 

TATAAGGTAG GAGTGAAAAA CTAGCAATGC CAAAGGTAAT CCAATTGAGG AAGTACCAAG 97 80 

GAAGAAGCTG TAAATCTAGG ACAAAGTGCT GGAACTTGTA GCCCTTCATA AAGGAACGGC 9840 

TAGTTTTTAG GATTCGTCTT GGTGGGACCT GTCCTAGGTC TAGACTATAA CAGAGAAGAA 9900 

ATTCCACCTG TGAATAGGCA TAATACTGTG GAATATAGAG GATATTTCCT ACAATGATCA 99 60 

AGATGAGACT TGCAAGAAAG TAGAGTCCAA AGACCATGAG GAAACGCTCG GTTTCAACTG 1002 0 

ATGAGAGATC TAGATTTGGA AACTCAGGAT GTAGGGTGAC GAATTTTTTG GCTAAAAAGC 10080 

TACTATAAAA GAGGAGGTAA ATCCCAAGTA AATTAGGGAT ACTCCATAAA AAGAGATAGA 1014 0 

AACGTTTGAG AAGTAGGGTC AAAAAGGTTT GAGAAAAGCG CTCCTCATCA AAGAGAGCTA 10200 

GGCTGTTTTT TACAGATGGC TCCGTTTTAG AATCTTTCAT GAGTGTCAGT GTTGCATAGA 102 6 0 

CGGAACTGGT CAAAAGAATA GTCCCGATAA AGGAGACTAG TAGAGGAAAG AGGTAGGTTT 103 2 0 

GAAGTATTTG GCCAAGTATG CTGAAAAATG GCTGTTCTAA AACAGTCCCG TGGATCCGAG 10380 

ATAAGGGATT AAGAAAACCA GATAAGATGA CCAGCATACT GGGAAGGATA TAGAGGAGAA 10440 

AGAGACGGGG GGTGTCAGCC TGAAAATGTT TTGACTCCTG ACGAATTGTT TTTAAATCAA 10500 

TTTTTGGATA GTTCATTCTC TTATTATACC ATAGTTCTTA TACATAGTTC GTGACAGTTC 10560 

CTACTTTTTT TGATAAAATC ATACAGTGTG TCCTTGGGCA CACTGTATGA ACTGGGACTG 1062 0 
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TCTTTCCCAG CTTCGGAGGT AAAAAATGTC AGATTCACCA ATCAAATATC GTTTGATTAA 106 80 

GAAAGAAAAA CACACAGGAG CTCGTCTGGG AGAAATCATC ACTCCCCACG GTACCTTTCC 10740 

GACACCTATG TTTATGCCAG TTGGGACACA AGCCACTGTC AAAACTCAGT CACCTGAAGA 10800 

ATTGAAGGAG ATGGGTTCGG GAATTATCCT ATCAAACACC TATCATCTCT GGCTTCGCCC 108 60 

TGGAGATGAA CTCATTGCAC GCGCTGGTGG TCTCCACAAG TTCATGAATT GGGACCAGCC 10920 

TATCTTGACA GATAGTGGTG GTTTTCAGGT TTATTCTTTA GCAGATAGCC GTAATATCAC 10980 

AGAAGAAGGA GTAACCTTTA AAAATCATCT AAATGGTTCT AAGATGTTCC TATCCCCAGA 11040 

AAAAGCCATC TCTATTCAGA ATAATCTGGG TTCAGACATC ATGATGTCCT TTGATGAATG 11100 

TCCTCAGTTT TATCAACCTT ATGACTACGT TAAGAAATCG ATCGAGCGTA CCAGCCGTTG 11160 

GGCTGAGCGT GGTTTGAAGG CTCACCGTCG TCCACATGAC CAAGGTTTGT TTGGAATTGT 1122 0 

GCAAGGTGCA GGATTTGAAG ACCTTCGCCG CCAATCAGCT CATGATCTTG TCAGCATGGA 112 80 

TTTCTCAGGC TACTCTATCG GTGGTTTGGC AGTGGGAGAA ACCCATGAAG AGATGAATGC 113 4 0 

GGTCTTGGAC TTTACAACTC AACTGCTGCC TGAAAATAAA CCTCGTTATC TGATGGGTGT 114 00 

GGGAGCGCCA GATAGCTTGA TCGATGGGGT CATTCGTGGG GTGGATATGT TTGACTGTGT 114 60 

CTTACCGACT CGAATTGCTC GTAACGGGAC TTGTATGACC AGTCAAGGAC GTTTGGTTGT 1152 0 

GAAAAATGCC CAGTTTGCTG AGGACTTTAC GCCACTGGAT CCTGAGTGTG ATTGCTACAC 11580 

ATGTAATAAC TATACACGCG CTTACCTTCG TCACCTGCTC AAGGCTGATG AAACCTTTGG 11640 

TATCCGCTTG ACTAGCTACC ACAATCTTTA CTTCTTGCTT AACCTGATGA AGCAAGTGCG 11700 

ACAAGCCATC ATGGATGACA ATCTCTTGGA ATTCCGTGAG TATTTTGTGG AAAAATATGG 117 60 

CTATAATAAG TCAGGACGTA ATTTCTAAAA TGGAATTGAT ATAAAAAAAT CCTAAGTTTT 1182 0 

CTCTTAGGAT TTTTCTTCTT TTTTTGATAG AATAAAGTGT ACAATGAAAG GAAGAATAAA 118 8 0 

CTCGTATGCG CATTAAATGG TTTTCCTCGA TTAGG 11915 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9069 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
GAGAGGGCAA CAGTTCTATC GCTTCAAATT TTTTCTTGGT TTGCAGATAT TCAAGAATCG 
GGAGTTTTTC TATAGTATTC GGCAGATTTA TTACAGCCAA GCATCTCAAA AATACGGACA 
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GCATCCTCCA TCTTTTTCTG GCCTTCCTTG ACTCTACCTT GCTTGCTATC AAGGAGACCT 180 

TCTGCCCACA GATAAACAAT TCGGAAATAG GTCTCATTTT CCTTGTAGAA ATGCTCTTCG 240 

ATAACACGTT TAAAATAATA GGCATTGGTA AATTCTTCAC ACTCAATACT AGCTAAAAAG 300 

CCATTCAATA GTATAGTATG AAAAAGGTTT CGATTGCCAG ACATTTCCAT TAGAAAATCA 3 60 

GATTTACGTA CCATTTCTCG TACATATCTA GTAAAAAGAG AAACAGATAA AAATGGAGAA 420 

CTGACTGAAA ATAAATTGAG TTCATAGATT CCCCAGATCT CGGTAGAAAA CAAATAATCA 480 

TGAAGGACTT TTCCTTCCTC TGCTGTTAAG TCTACCCTTT CATCTATGCT CTTCATATAA 540 

GACTTGATAA TAATGGCATT TAGAATATGT TTCTGTTTGT TGTGAGAATG GGCATGCTTT 600 

TATACTCCCT GCGATATAAG TCCTCAAGAG GTGCTATATT CTTTGGTTCC AAGACATCTG 660 

TAATTTCTTT TCTCAACTCA GAATCTGTAT CATACTGGAA ACCTCTTGCC AGAAAGAGGA 720 

TCTCCTCCAC ACTGGCAGAT ATATTTTCCA GAGCAAATAG AAACTTTTCC ACCGAAAGCT 780 

CACTCTGACC TGTTTCAAAA CGGGACAACA TAGACGGCGA AAATTGTCCT CCGGTTGCTT 840 

GTCTCAGTGA GATATTTCTT GACTCTCGTA ATTGTCTAAA GACTTTTCCA ATCTGCTCCA 9 00 

TAGACTTCCC CTTGATTCCG TATTTTCTTC ATTTTATCAT ATTTTTCAGA AAATTCATCA 9 50 

AAAACTTGCC AAATTGTCAG AATTATGAGA AAATAGAGGA TATTTATCAC GTGGAGGGAC 1020 

TGCTATGAGA GACGATATCA AAATCAATGA CCGTGCTTTG GCCTTGCAAG ACCAAATTAT 10 80 

CGAAAAACTA GAGAAAGTTT TTGATACAGA TGTGGAATTG GATGTTTACA ATCTAGGTCT 1140 

GATTTATGAA ATCAATCTGG ATGAAACGGG GCTCTGCAAG ATTGTCATGA CCTTCACCGA 12 00 

TACTGCCTGT GATTGCGCCG AAAGCCTGCC TATTGAAATC GTGGCAGGTC TGAAACAAAT 12 60 

CGAGGGTATC AAAGATATCA AGGTTGAAGT TACCTGGTCG CCTGCTTGGA AAATCACACG 1320 

AATCAGTCGC TATGGCCGTA TTGCCCTTGG ACTACCACCT CGTTAAGCAG ACCAATCACT 13 80 

TTTAAAGATG AAAATCAAAG GGCAAACTAG AAAACTAGCC GCAGGTTGCT CAAAACACTG 1440 

TTTTGAAGTT ATGGATAGAA CTGACGAAGT CAGCTCAAAA CACTGTTTTG AGGTTGTGGA 1500 

TAGAACTGAC GAAGTCAgCT CAAAACACTG TTTTGAGGTT GTGGATAGAA CTGACGAAGT 1560 

CAGCCCAAAA CACTGTTTTG AGGTTGTGGA TAGAACTGAC GAAGTCAGTA ACCATACCTA 162 0 

CGGCAAGGCG ACGTTGACGT GATTTGAAGA GATTTTCGAG TATGAGTTTA TTTTTTACCT 1680 

GACTTGTCCA TATTCCAGAA GTCTGTCACG GCTCCGCGTG AAGCAGATGA TACGATGTGG 174 0 

GCATATTTAC CGAGGACACC ACGGCTGTAA AGTGGTGGCA AGGTTGTTTC TGCCTTGCGT 1800 

TTTTCAAGTT CTTCTTCGGA TACGGCCATA GAAATTTCTT TGGTATCTTG GTCAACCGTA 1860 
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ACGATATCGC CGGTACGGAG ATAGGCAATT GGTCCACCAT CCTGAGCTTC AGGAGCGATA 192 0 

TGTCCAACAA CCAGACCATA AGTACCACCA GAGAAACGTC CGTCCGTCAA GAGGGCCACC 198 0 

TTATCTCCCT GACCTTTACC AACAATCATT GAAGAAAGTG ATAGCATCTC AGGCATACCA 2040 

GGACCACCTT TAGGTCCAAC AAAACGAACA ACGACTACAT CGCCATCAAC GATTTCATCT 2100 

GTCAGAACGG CCTGAATCGC ATCTTCTTCT GAGTCAAAGA CCTTAGCTGG CCCAACGTGA 216 0 

CGACGCACTT TAACACCTGA TACCTTGGCA ACTGCACCGT CAGGAGCAAG GTTCCCGTTC 222 0 

AAGATGATAA GCGGACCATC CGCACGTTTT GGATTTTCAA GTGGCATGAT AACTTTTTGG 2280 

CCTGGAGTCA AGTCTGCAAA GTCAGCCAAG TTTTCAGCTA CAGTCTTACC AGTACATGTG 234 0 

ATGCGATCTC CGTGAAGGAA ACCATTTGCC AACAAATACT TCATAACCGC AGGGACACCA 2400 

CCGACTTCGT AGAGGTCTTG GAAGACATAC TGACCAGATG GTTTCAAGTC GGCCAAGTGA 2460 

GGCACACGTT CTTGAATCGT ATTGAAGTCC TCAAGTGACA AGTCAACATT TGCGGCATGG 252 0 

GCAATGGCGA GCAAGTGAAG AGTGGCGTTT GTAGAACCAC CGAGAGCCAT CGTTACAGTG 2 580 

ATAGCATCTT CAAAGGCTTC ACGAGTCAAG ATATCTGATG GTTTGAGACC AAGTTCCAAC 2 640 

ATCTTAACAA CAGCACGTCC TGCTGCTTCG ATATCTTCTT TCTTATCAGC TGATTCAGCT 2700 

GGGTGAGAGG ATGACCCTGG CAAACTCATC CCTAGAACTT CGATAGCAGT TGCCATGGTA 2760 

TTAGCAGTAT ACATACCACC ACAACCACCA GGGCCAGGGC AGGCATTACA TTCAAGACGT 2 820 

TTCACGTCCT CAGCTGTCAT GTCACCGTGG TTCCATTTTC CGATACCTTC AAAGACAGAA 2 880 

ACCAAGTCGA TATCTTTACC ATCAAGATTT CCCGGTGCAA TAGTTCCACC ATAGGCGAAA 2940 

ATAGCTGGGA TATCCATATT AGCAATAGCA ATCATAGATC CAGGCATGTT CTTGTCACAG 3 000 

CCACCGATAG CGACGAAGGC ATCCACGTTG TGACCACTCA TAGCCGCCTC GATGGAGTCC 3060 

GCGATGATGT CACGAGATGT TAGAGAGAAA CGCATACCAG GCGTTCCCAT AGCGATCCCG 3120 

TCCGCTACGG TAATGGTTCC AAACTGTACA GGCCAAGCGC CTGCAGATTT GACACCTTCT 3180 

TTAGCCAGTT TCCCGAAATC ATGCAAGTGA ATGTTACATG GTGTATTTTC CGCCCAAGTC 3240 

GAAATCACTC CCACAATCGA TGTTTCAAAG TCCTTATCTG TCATACCAGT CGCACGAAGC 3 300 

ATAGCACGGT TAGGTGATTT AACCATGCTG TCATAAATGC TACTGCGGTG ACGTTTATCT 3360 

AATTCAGTCA TCTTATCCCT CCCATTTCAG TTTTTACTAT TATAGCACAA TTTTCGCATG 3420 

AAGAACAGAA TAAAATTCTT GAATTTTCAG AAAATTCTAT ACACATGTGA AATATTTAAA 3480 

ATTAAAAACA ACAAAGCGGA TTAGTGCACT TTCTGATGAC CAGAATATGC TTTTTAATCC 3540 

GCTTTCTTTA AATAACGTAC TGTAATTTTT ACAGAAATTC TTTCAAATAA GTGTATTTAA 3 600 

CATCTATCTT GCATTATAAA TTTCTAGAAC CTTCTCTTTT ATATTCGATT CACTCAAACC 3 660 



WO 98/18931 



PCT/US97/19588 



743 

ATACTCATTA AGAAGATAAT CCATTTTCCC TACTTGACCG AATCTTTCTT GAACACCCAT 372 0 

CCGATGAATT TTTGTTATTC CATCATCAGA GAATAATTCA CATAAAGCAC TGCCAATTCC 37 80 

ACCTATCTGA TTGTGGTTTT CTACAGTAAA TATAGTTTTT CCACTTAACA TTGTTTTTAT 384 0 

CTGTTCTGGT ATCGGTTTGA TTCTAAATAA ATCTATCACA CCTACTGAAT AACCTAATTT 3900 

AGACAGTTCA TCTGCAACTC GAATACTTGG AGCAACCATT ATGCCAGAAG CAACGATTAC 3960 

AAGATCTTCA CCATGCCTTA ACTCAATGTA GCCTTTAGAA AAATCTTCTC CACCTTGATA 402 0 

CACAGGAACT GGAGCTTTTC TAATTGTTCG AATATATTTT AGTCCTTTTA AGTCTAATGT 4080 

CTGGTTCAAT ATTTCACGAA ATTGGATATC ATCAGTTGCT TCGAAAATGA TTGATTTAGG 414 0 

AATTAAACGT AACAATCCAA TTTCTTCAAA TGGCATATGT GTTCCACCAT TCATCTCTGC 42 00 

CGTTACTCCT GCATCTGATC CAATCACAGT GGCATCCAAT TGTGCGTATC CAAGAGAAAT 42 60 

AAATAATTGA TCAAATACTC TTCGTGAAGC AAAAGGACCA AATGTATGAA GATAAGGTCT 432 0 

AAACCCCTGA ATAGACAAGC CTGCTGCAAG GCCGACCATT TCTGCTTCCA TAATCCCAAC 43 80 

ATTCACATAA CGGTCTCCAA AGTCCTTTTC AAGATTATTA GTAGCCATCG AACTTGACAA 444 0 

ATCGGCTTCT AAGACTACTA TATCAGAATC ACTTTGATTA GCCTCTAAAA GGAAGTCTCT 4500 

ATATACATGC CGTAATTCTT TCGTACTTCT CATCATTCTG TTTCCTCCAA TTCCTGACTT 4560 

AATCTTTCTA CAACTGAAGT TAACATTTGT TTCTCCTCTA CAGTAGGGCG AAGATGATGA 462 0 

TTGGATTTCA TTTCTTCCAG CTCTTGAACC CCTTGACCTT TAATAGTATC TAATACAATG 4680 

CACTTAGGTG ATGAATTATT TGACTGTTTT AATTGGACAA TCCCTTCATA AATTTCTCTA 474 0 

ATATCTGAAC CCTTGACCCT AATGGATTCA AATCCAAATG CTGAAAATTT TTCTACGAAA 4 800 

TCACCTGGAT TACAAATATC CTTTGTAAAA CCATCTAATT GTTTTTTGTT ATCATCAACA 48 60 

AATACAATTA AGTTGGATAA CTGTTGATGA GAAGCAAACT GTATAGCCTC CCAACATTGT 492 0 

CCCTCATTTA ACTCACCATC TCCAACAATA GCGTAAGTAT AAAAGGGACT CTTTCTTATT 4 980 

CTCTGACCAT ATGCAAGTCC AGTTGCAACA CTAATTCCTT GTCCTAAAGA GCCCGTTGTC 504 0 

ATATCTATGC CTGGCGTTAG ATTTCTATCA GGATGAGACG GTAATTTGGT TCCATTTGTA 5100 

TTTAAAGAAT ATAAGAATTC TTTGTCAAAG AAACCATTCA AATAGAGTGT ACTGTATAGA 5160 

GCTGGTCCTC CGTGACCTTT TGATAATATG AAATAATCTC TATCTCGTGC TGCAAATATT 522 0 

TCTGGAGTCA TTGGCATTAT TTCACCATAA AGCACCGCTA AAACTTCTAC GATAGACAGA 5280 

CTTCCTCCGT AATGTCCGAA TCCAAGATGA TTCAATGTTC T AAGAGT AT T TAATCGGATG 534 0 

TTAGTCGCAA ATTTTCTTAA CCCATCTTCT CTATTTTTAC TTAAAATCAT CCCTTATTCC 5400 
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TCCGTTGCAG ATGGCTTTTT AATAAAGGAT ACTCCAAACA TAACTGCTAG AATAAGAACA 54 60 

AGACCAATCA CAATGCCTGC TTGTGAGCCA AATTGATTTA ACATTCCTAA AATAATTCCT 552 0 

GATAGACCAA AATCTGCATC TGAGAAAGTT GATCCTTGGA AACCAAGTCC TCCCAAAACT 5580 

GGCATTAAAA AGACTGGAAG AAAACTGATT AAAATACCTT GTAAAAATGC TCCAATAGTG 564 0 

GCTCCACGAA CACCACCAGA TGCATTCCCA ATGACACCTG CAGTCGCTCC ACAGAAGAAA 5700 

TGAGGCACAA CACCTGGTAA GATAACAACC GTTCCTGAAG CAATCATAAT TACCATACTT 5760 

ACTAAACCAC CAACAAAACT AGAGATAAAT CCAATTAGAA CTGCATTGGG TGCATAAGTA 582 0 

TAAACAATCG GACAATCCAA AGCAGGTTTT GAATTAGGTA CAAGACGCTC TGAAATACCT 588 0 

TTAAAGGCTG GAACAATTTC GCCCAAAATA AGGCGAACAC CTGCTAAAAT AACAAATACC 5940 

CCTGCTGCAA ATTGACCTGC TAATTGTAAA GCATAAACTA GACCACTTGT ACCACTACTG 5000 

ATTTCTTTTT CTATATATTC TGACCCTGCA AAGATAGCTA CAATAATGTA AATAACTGCC 6060 

ATGGATAAAG TAATACTAAC AGTACTATCA CGTAAAAAAG CTAAACTCTT TGGAAATTTA 6120 

ATGTCCTCTG TTGATTTTGA TTTGTCACCG ATAAGGCTAC C AGTAAAAC C ACTCAACCAA 6180 

TATCCCAAAG AACTGAAATG ACCTAAAGCT ACCTTGTCAT TTCCAGTTAA TTGAACCATA 6240 

TATTTTTGCA CAAATGCTGG GGAAATACTC ATAATAATAC CGAGTGCTAA TCCTCCTAGT 63 00 

AAGATGAGAG GCAAGCTAGT AAAGCCAGCA ACTGATAAAA TGACCGCAAT CATACATGCC 6360 

ATATATAGAG TGTGGTGCCC TGTTAAAAAA ATATATTTAA ATCGAGTAAA ACGAGCGATT 6420 

AAGATATTGA ACACCATGCC TGCAAACATA ATCATTGCAG TAGCTGAGCC ATATGTTGTT 6480 

AAAGCTACAG CTACAATTGC TTCATTATTC GGCACAACGC CAGATAAATG AAAAGCATGC 6 540 

TCAAACATGG TACCAAATGG ATTTAAAGAA TTTTGTACAA TTCCTGCACC ACCAGATACA 6600 

ACTAAGAAAC CAACAAAGGT CTTAATTCCA CCTTTAATAA TATCAGGTAA TTTCTTCTTC 6660 

TGAAGAACTA ATCCTAAGAT TGCAATTAAA GCTACTAAAA TAGCTGGTGT ACTAACAATA 6 720 

TCCAATATGA ACTTCATCAT GACGCTAGCC TCCTATATAA GTCCTTTTTC TTCACAAAGT 6780 

TTAGTAATTA ATTCTCGTAG TTCATCCATA TCAATAATAC TATTTAAGAT ACGAACATCT 6 840 

CCAAGATGAC TAGCTGAATC AGCTAGATCA CGACCAACAA TCCAAATATC AGCTGCATTT 6900 

GGATCTGCTC CACCTAAATC ATAATGTTCA ACTTCTACAT CCGAAACATT CAAATCACTC 6960 

AATACAGATT CAATATTCAT CTGTACCATA AAACTTGAAC CTAATCCTGA ACCACAAGCT 7020 

GTACCAATTT TTAACATTAT CTAATCCTCC TGTTTAATTA TCATTTTAAT GTCATCATAG 7080 

TTTTTTGATG ATATTAAAGT TTGAACATGA TTTTTATCTC TTAAAATTGT TGTTAAATGT 7140 

GACAAAGCCT TTAAATGACT CTCATTATCA ATGGCTGCAA TACAAATCAA CAATCTTACC 72 00 
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TCTTGTTCTG GATTATCCAA 
ATTTCATTCA CACCTTCATC 
ATAAAAGGTC CAAACTCTTC 
TTATCTTGAT CCAAAAGCGG 
TGCGAACTAA CCTGATAGGT 
TTTCTATCAT TTTTTTGGTA 
GTAATAATTC CATATCTTTT 
TCTAAATTCG GAAAATCTTT 
ATCATAACCG AAACTAGATA 
ATATCATAGT CACTACTAGC 
TCAATTTGAG GAAATAATGC 
TTAGGACAAA TGATTGCTGC 
TAACCTCCGA AATGGATAAC 
GCGTCCATCA AGGGCATCAA 
TTAACATGCA ATACATATTC 
GGTATAAGGT GGCGGCGAAG 
GAAATATCTT CCATTCTACT 
ACATCTACTT CACCTTCAAA 
TCAGAAAACA CCGTATCTAT 
ATATCTTGAA TAACAGGAGA 
TGATACCTAC ACAGAATGAA 
GGTACTAGCT GTAGCTTCTC 
AATGAATACC CCCAACTGGA 
AGCAACTCAC TAACCATTTG 
GTATAACCTT TTGCTCTACT 
AATGATTGAA TATCAGATAA 
CTATTCGATA TAAAATCTAA 
GACTTTGGTA AAACCAATTC 
TTTGATAAAC TATAGCGACC 
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TAAATAAATC GGTTCTTCCA AAACTAACAT TGACATTCCT 72 60 

TGGCCGAGCG TGAGGAATTG CTACTCCCTT CCCTAAATTA 7320 

TACTTTTTGA ATCATTGCCT CAGGGTAGTT CTCAGTTATC 73 80 

TTTAGCTGCT AAACGAATCG CCTCCTTCCA TCCTAATTTT 744 0 

TTCTTTGGTA ATAAGTTGTT CTAGCACTGG TACAATTTCC 7500 

AAGATAATTC TTTAACGCCA ATCTTAATTC CAATTCTTGT 7560 

GACAATATTC AGGATTTGTT CAATCTCAAA ATCTCCATAC 7 62 0 

TAACACTAGT TCTACTAGTT GTATTGCTTG CTCTTCAGTC 7 680 

ATTTGGCTTT TCTGTCTCCA CCTTTATGGT AGAAAAAACC 7740 

TTTCACCTGT AAATCATCAA TCTTTGAGGT TCCTATAAAC 78 00 

TAATAGATTC TCTTTTAACA TCAATGAAGA ACTAACACCA 78 6 0 

TTTATACCAT TTTTGAGGCA AAGTATCTGC TTTCTTTAAA 7 92 0 

AAAATATGCT GTTTCACTAT CAGGTATGGG ATTGTCAATA 7 9 80 

AGAATCTTTG ACTAATTCAA ATAAATCAGG ATAATGTTCT 804 0 

ATTTGAACTA GGTAGGCCGA ACTTTAATCT ATAGTAAGCC 8100 

ATTTTCTCTC AATCCTTCCC TTTGTTTAAA ATGTAACAAA 8160 

TATAATAGCC TCTGTTAATT GATTAAAGTA AACCGGAGCA 8220 

GCAACTTGAT AATAAAACGG TGATATAGCG ATAATCATCC 82 8 0 

AATTCCCAAA TCAACCACTG TATCCAATAA AATAGTGGTT 83 4 0 

TACTAATGTC TCTGAAAGAC ATACTCTTTC AACATCCCTT 840 0 

TACTAAACCG AAAAGGTAAA CTTTTAATTG ATTAACAATA 84 6 0 

ATAATAATCT TTAACTACCT GATCAATCAA ATCATAAGTT 852 0 

TAAAACATAA TCCAAACCCC AAATCCCTAT GGAGGATTCC 85 8 0 

AAAAGC T AAG CGGTGCTTAT TCCACTCTGA ACCGTGTAAA 864 0 

GTACCCTAGC TCCAAATCAT TATCTAACAT AATCTTTCTT 8700 

GGTTGTATTC TTACTTACTT TCAAAAAGTC TTGGTAATGA 87 60 

TCGGCAAAAA GTGTAAAGAT AGATTAAAGC TAAGCGAGTC 8820 

ATCCGACTTA ATAATATCTG TCAAAGACTG CTTCGTACGA 8880 

TTGCTTTTTA TCCAGCACTA TCCCTTTATT AGCTAGATAA 8940 
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GGCACTAAAT AATCTATTCC TTCTTTGACT TCCTTTATAG GTAAGCTCAC CTTAACAGAT 9000 

AATTCATATA ACGATAGCTC ACAATGATCC ATCAAAGTCA TCAAAATAAC TAGTGCTCTA 9060 

TAATCAAAC 90 69 
(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8654 -base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

CGAGACAACA AGATGAAGAA AAATTTGCCC TATCGTTTGT GGCGCTTGCA AGTGTAGCAC 60 

TTCTTGCAGC CTGTGGAGAA GTGAAGTCTG GAGCAGTCAA CACTGCTGGT AACTCAGTAG 120 

AGGAAAAGAC AATTAAAATC GGGTTTAACT TTGAAGAATC AGGTTCTTTA GCTGCATACG 180 

GAACAGCTGA ACAAAAAGGT GCCCAATTGG CTGTTGATGA AATCAATGCC GCAGTGGTAT 2 40 

CGATGGAAAA CAAATCGAAG TAGTCGATAA AGATAATAAG TCTGAAACAG CTGAGGCTGC 3 00 

TTCAGTTACA ACTAACCTTG TAACCCAATC TAAAGTATCA GCAGTCGTAG GACCTGCGAC 3 60 

ATCTGGTGCG ACTGCAGCTG CGGTAGCGAA CGCTACAAAA GCAGGTGTTC CATTGATCTC 420 

ACCAAGTGCG ACTCAAGATG GATTGACTAA AGGTCAAGAT TACCTCTTTA TTGGAACTTT 4 80 

CCAAGATAGC TTCCAAGGAA AAATTATCTC AAACTATGTT TCTGAAAAAT TAAATGCTAA 540 

GAAAGTTGTT CTTTACACTG ACAATGCCAG TGACTATGCT AAAGGGATTG CAAAATCTTT 6 00 

CCGCGAGTCA TACAAGGGTG AAATCGTTGC AGATGAAACT TTCGTAGCAG GTGACACAGA 6 50 

CTTCCAAGCA GCCCTTACAA AAATGAAAGG GAAAGACTTT GATGCTATCG TTGTTCCTGG 720 

TTACTATAAT GAGGCTGGTA AAATTGTAAA CCAAGCGCGT GGCATGGGAA TTGACAAACC 7 80 

AATCGTTGGT GGTGATGGAT TCAACGGTGA GGAGTTTGTA CAACAAGCAA CTGCTGAAAA 840 

AGCATCAAAC ATCTACTTTA TCTCAGGCTT CTCAACTACT GTAGAAGTTT CAGCTAAAGC 900 

TAAAGCCTTC CTTGACGCTT ACCGTGCTAA GTACAATGAA GAGCCTTCAA CATTTGCAGC 960 

CTTGGCTTAT GATTCAGTTC ACCTTGTAGC AAACGCAGCA AAAGGTGCTA AAAATTCAGG 1020 

TGAAATCAAG AATAACCTTG CTAAAACAAA AGATTTTGAA GGTGTAACTG GTCAAACAAG 1080 

CTTCGATGCA GACCACAACA CAGTCAAAAC TGCTTACATG ATGACCATGA ACAATGGTAA 114 0 

AGTTGAAGCA GCAGAAGTTG TAAAACCATA ATAGAAAAAT GTTGAAATAG GGAATGAGCC 12 00 

TTTGACTCAC TCCCTGTTTC GATATTTAAT ACTCTTCGAA AATCTCTTCA AACTGCGTCA 12 60 
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ACGTCGCCTT GGATTATATA TGTGACTGAC TTCGTCAGTC TTATCTACAA CCTCAAAGCA 132 0 

GTGCTTTGAG CAACCTGCGG CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATAA 138 0 

GAACCTATCA AAAAGTGAGG GAAAACCCTC GGAATTATAA ATAGAAAGAG TGAATCTTAT 1440 

GCTCCAACAA CTCGTAAATG GTTTGATTCT AGGTAGTGTT TACGCGCTGT TAGCCCTAGG 1500 

ATATACCATG GTTTACGGAA TTATCAAGCT CATCAACTTC GCCCATGGTG ATATTTATAT 1560 

GATGGGAGCC TTTATCGGTT ATTTCTTGAT CAATTCTTTC CAAATGAATT TCTTTGTAGC 1620 

GCTTATTGTA GCTATGCTAG CGACAGCTAT TCTTGGTGTC GTGATTGAGT TTCTTGCTTA 1680 

CCGACCTTTG CGCCACTCTA CTCGTATTGC TGTTTTGATT ACGGCTATTG GGGTTTCTTT 1740 

CCTATTGGAG TATGGAATGG TCTATCTGGT TGGTGCCAAT ACCCGTGCCT TCCCTCAAGC 1800 

GATTCAAACA GTTCGATATG ATTTGGGACC AATTAGCTTA ACAAATGTGC AGTTAATGAT 1860 

TTTGGCCATT TCCTTGATTT TGATGATTTT GTTACAAGTC ATTGTCCAAA AGACTAAGAT 192 0 

GGGGAAAGCC ATGCGTGCAG TATCAGTAGA TAGCGACGCG GCGCAATTGA TGGGGATCAA 1980 

TGTAAACCGT ACGATTAGCT TTACCTTCGC TTTGGGTTCT GCTCTTGCGG GTGCGGCTGG 2 040 

TGTTCTGATT GCTCTTTATT ATAACTCTCT TGAGCCTTTG ATGGGGGTTA CTCCAGGTCT 2100 

TAAATCTTTC GTTGCCGCAG TACTTGGTGG TATCGGAATT ATTCCTGGTG CGGCTCTTGG 2160 

TGGCTTTGTG ATTGGTCTAT TGGAAACCTT TGCGACTGCC TTTGGGATGT CAGATTTCCG 2 2 20 

TGATGCCATT GTTTATGGAA TCTTGTTGTT GATCTTGATT GTCCGCCCAG CTGGTATCCT 2280 

TGGTAAGAAT GTGAAAGAGA AGGTGTAAAC GATGAAGGAA AATTTAAAAG TTAATATTCT 2 340 

ATGGTTACTC CTTTTGTTAG CTGGCTATAG CTTGATTAGT GTACTGGTTT CAGTCGGAGT 2 4 00 

ACTTAATCTA TTCTATGTAC AGATTTTACA ACAAATTGGA ATTAATATTA TTTTGGCTGT 24 60 

TGGTCTCAAC TTAATCGTTG GTTTTTCAGG ACAATTTTCA CTTGGTCATG CTGGTTTCAT 2 52 0 

GGCGATTGGT GCCTATGCAG C AG C T ATT AT TGGTTCTAAA TCACCAACCT ACGGTGCCTT 2 5 80 

CTTTGGAGCT ATGCTTGTAG GGGCTTTGCT TTCAGGAGCA GTTGCCTTAC TTGTCGGCAT 2 640 

TCCAACCTTG CGCTTGAAGG GGGACTATCT TGCGGTAGCA ACTCTGGGTG TTTCTGAAAT 2 7 00 

TATCCGTATC TTTATCATCA ATGGTGGAAG CCTTACAAAT GGTGCGGCAG GTATCTTAGG 27 60 

GATTCCTAAC TTTACAACTT GGCAAATGGT TTACTTCTTT GTCGTGATTA CAACCATTGC 2 82 0 

AACCTTGAAC TTCTTGCGTA GCCCAATTGG TCGTTCAACC CTCTCTGTTC GTGAAGATGA 2880 

AATCGCTGCT GAGTCAGTTG GGGTTAATAC GACTAAAATT AAAATC AT C G CTTTTGTCTT 294 0 

TGGTGCCATT ACTGCAAGTA TTGCTGGGTC ACTTCAGGCA GGATTTATCG GGTCTGTTGT 3 000 
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ACCGAAAGAT TACACCTTCA TCAACTCAAT CAACGTTTTG ATTATTGTTG TATTTGGTGG 3 060 

ACTCGGTTCC ATTACAGGTG CGATTGTTTC GGCTATTGTT CTGGGAATTT TGAATATGCT 3120 

TCTCCAAGAT GTTGCTAGTG TGCGTATGAT TATTTACGCT TTGGCCTTGG TATTGGTAAT 3180 

GATTTTCAGA CCAGGTGGAC TCCTTGGAAC ATGGGAACTG AGCCTATCAC GTTTCTTTAA 3 240 

AAAATCTAAG AAGGAGGAAC AAAACTAATG GCATTACTTG AAGTAAAACA GTTAACCAAA 3 300 

CATTTTGGTG GTCTAACAGC TGTTGGAGAT GTGACTCTTG AATTGAACGA AGGGGAACTG 3 360 

GTTGGATTAA TCGGTCCAAA CGGAGCTGGG AAAACCACCC TTTTCAACCT TTTGACCGGT 3420 

GTTTATGAAC CAAGCGAGGG AACAGTAACC CTAGATGGTC ACCTTTTGAA TGGGAAATCA 3480 

CCTTATAAGA TTGCCTCTTT GGGACTTGGA CGTACTTTCC AAAATATCCG TCTCTTTAAA 3 540 

GATTTAACAG TTTTAGATAA TGTTTTGATT GCTTTTGGAA ACCATCACAA ACAGCATGTT 3600 

TTTACTAGTT TCTTACGCTT ACCAGCTTTT TACAAGAGTG AAAAAGAATT AAAGGCTAAA 3 660 

GCTTTGGAAT TGTTGAAAAT CTTTGATTTA GATGGTGATG CAGAGACTCT TGCTAAAAAT 3720 

CTTTCCTACG GACAACAACG TCGTTTGGAA ATTGTTCGTG CCCTTGCTAC GGAACCTAAA 3 7 80 

ATTCTCTTCT TAGATGAACC AGCAGCAGGT ATGAACCCAC AGGAAACAGC CGAATTGACT 3 840 

GAGTTAATTC GTCGTATCAA AGATGAGTTT AAGATTACAA TCATGTTGAT TGAACACGAT 3 9 00 

ATGAATCTGG TCATGGAAGT AACAGAACGT ATCTACGTAC TTGAATATGG CCGTTTAATC 3 9 60 

GCTCAAGGAA CTCCAGACGA AATTAAGACC AATAAACGCG TTATCGAAGC TTATCTAGGA 4 020 

GGTGAAGCCT AATGTCTATG TTAAAAGTTG AAAATCTTTC TGTGCATTAC GGTATGATCC 4080 

AAGCAGTTCG TGATGTAAGC TTTGAAGTTA ATGAAGGAGA AGTTGTTTCC CTTATCGGTG 4140 

CCAACGGTGC AGGTAAGACA ACTATTCTTC GCACCTTGTC AGGTTTGGTT CGACCAAGTT 42 00 

CAGGAAAGAT TGAATTTTTA GGTCAAGAAA TCCAAAAAAT GCCAGCTCAG AAAATCGTGG 4 2 60 

CAAGTGGTCT TTCACAAGTT CCAGAAGGAC GCCACGTCTT TCCTGGCTTG ACTGTTATGG 4 3 20 

AAAATCTTGA AATGGGAGCT TTCTTAAAGA AAAATCGTGA AGAAAATCAA GCTAACTTGA 4 3 80 

AGAAGGTTTT CTCACGCTTT CCTCGTCTTG AAGAACGGAA GAACCAAGAT GCAGCCACTC 4 440 

TTTCAGGGGG GGAACAACAA ATGCTTGCCA TGGGACGCGC CCTCATGTCA ACACCAAAAC 4 500 

TTCTTCTTTT AGATGAACCA TCAATGGGAC TTGCCCCAAT CTTTATCCAA GAAATTTTTG 4560 

ATATCATTCA AGATATTCAG AAGCAAGGAA CAACGGTCCT CTTGATTGAA CAAAATGCCA 4 62 0 

ATAAAGCACT TGCAATCTCT GACCGAGGAT ATGTACTGGA AACAGGGAGA ATCGTCCTAT 4 680 

CAGGAACAGG AAAAGAACTC GCTTCATCAG AAGAAGTCAG AAAAGCATAT CTAGGTGGCT 474 0 

AAAACAATCC AGTGGATTGT TTTAGTCGGC AGATGGAGAT TACGAAGTAA TCATCAATAT 4800 
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AGTCCGGGGG ACCTTTTTAG TCGGTAGATT GAGATTGCAA ACAAATCTGC ATCTACATTG 48 60 

AAAGCTTAAT TTCTAATAAT TGAAAAAATC GAATGAAAAA TTTCTTACCT TCATTCACAG 4920 

AGCTCGATTT CAGAGCTCTT TTTGCTAGCT TATTCATACT TTTCTGAATT TCGAAAAAGA 49 80 

AATGTAAGCG TTTGATAGAT TTACAAAAAG ATTGTATAAT AGGGATAAGA ATAGAAAAGG 504 0 

AGAAGTCTCA TGGCAGTTAA AGATTTTATG ACCCGCAAGG TAGTTTATAT TAGTCCAGAT 5100 

ATAACAGTAT CTCATGCAGC AGATTTGATG AGAGAGCAAG GTTTGCACCG TCTGCCTGTT 5160 

ATCGAAAATG ATCAATTAGT TGGTTTGGTG ACTGAGGGAA CCATTGCACA AGCAAGTCCA 522 0 

TCTAAAGCAA CAAGTCTTTC TATCTATGAG ATGAATTATC TTCTGAATAA GACAAAAGTA 52 80 

AAAGATGTCA TGATTCGCGA TGTTGTCACT GTCTCAGGCT ATGCTAGTCT AGAAGATGCA 534 0 

ACTTATCTGA TGTTGAAAAA TAAGATTAGT ATTCTCCCTG TCGTAGATAA CCATCAAGTA 5400 

TACGGAGTTA TTACTGACCG TGACGTTTTC CAAGCCTTTC TTGAAATTGC AGGTTATGGC 54 6 0 

GAAGAAGGGA TTCGTGTACG CTTTGTTACA GAAGATGAAG TTGGTGTTCT TGGAAAAATT 552 0 

GTTTCTTTGA TTGTAGAAGA AAAT TTGAAT ATCTCCCATA CAGTCAATAT TCCGCGTAAG 55 8 0 

GATGGTAAGG TGATTATCGA AGTGCAAATC GATGGATCAA TTGATTTACC AGCCTTGAAA 564 0 

GAAAAATTTG AAGCAAATGG TATTCAAGTG GAAGAAATCG CTCGCACTTC AGCAAAAGTC 570 0 

TTGTAAGAAG GGAAGCCCAA AGGCTTCTTT TTTCATGAAA AGGGGATTAG AGCAAAAGAT 57 60 

GGAAAGAAAT GATAAAATAT GCTATAATGA AATAATGTAA AAAAGGAGTA TTTATGGACA 582 0 

TTTCAGTAAT TCGTCAGAAA ATTGACGCAA ATCGTGAAAA ATTAGCTTCT TTCAGGGGGT 5880 

CTCTTTGACC TCGAAGGGCT AGAGGAAGAG ATTGCCATCT TGGAAAACAA GATGACAGAA 594 0 

CCTGATTTTT GGAACGATAA TATTGCGGCC CAAAAAACGT CGCAAGAATT AAATGAATTA 600 0 

AAAAACACTT ACAATACCTT CCATAAGATG GAAGAGTTGC AGGATGAAGT CGAAATTTTA 60 6 0 

TTGGATTTTT TCGCTGAAGA CGAGTCAGTG CATGATGAAC TGGTAGCGCA GTTAGCCGAA 612 0 

CTTGATAAGA TAATGACCAG CTACGAGATG ACTCTACTCT TGTCAGAACC TTATGACCAC 618 0 

AACAATGCCA TCTTGGAAAT CCATCCAGGT TCTGGTGGTA CTGAGGCGCA GGACTGGGGT 624 0 

GATATGTTGC TTCGTATGTA TACTCGTTAT GGTAATGCTA AAGGCTTTAA AGTGGAAGTG 6300 

TTGGATTACC AAGCAGGTGA TGAGGCTGGT ATTAAGTCGG TAACTTTATC ATTTGAAGGG 63 60 

CCTAATGCCT ATGGTCTCCT CAAGTCAGAA ATGGGTGTTC ACCGCTTAGT GCGAATCTCA 6420 

CCATTTGACT CTGCCAAACG TCGCCATACC TCTTTCACAT CTGTAGAAGT GATGCCAGAA 64 8 0 

TTGGATGATA CTATTGAAGT GGAAATCCGT GAAGATGATA TCAAGATGGA TACCTTCCGT 654 0 
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TCAGGTGGTG CCGGTGGACA AAACGTCAAT AAGGTTTCAA CAGGTGTACG TTTAACCCAC 6600 

ATTCCAACTG GAATTGTTGT CCAATCAACA GTAGATCGTA CCCAGTATGG AAATAGAGAT 6660 

CGTGCCATGA AGATGTTGCA GGCTAAGCTC TATCAAATGG AGCAAGATAA GAAGGCTGCG 6720 

GAGGTAGATT CTCTCAAAGG TGAGAAAAAG GAGATCACTT GGGGAAGCCA AATCCGTTCT 6780 

TATGTCTTCA CGCCTTATAC TATGGTAAAA GATCACCGAA CTAGCTTTGA GGTTGCTCAG 6840 

GTAGATAAGG TTATGGATGG GGACCTAGAT GGTTTTATCG ATGCTTATCT CAAGTGGCGA 6900 

ATTAGCTAAG ATAGAAAGGA ACTCACATGT CAATTATTGA AATGAGAGAT GTCGTTAAAA 69 60 

AATACGACAA CGGAACAACT GCTCTACGCG GTGTTTCGGT TAGCGTTCAA CCGGGGGAAT 7020 

TTGCTTACAT CGTAGGACCT TCAGGAGCAG GGAAGTCAAC TTTTATTCGT TCTCTGTATC 7 080 

GTGAAGTAAA AATCGATAAA GGAAGCCTAT CAGTTGCTGG TTTTAATCTG GTTAAGATCA 7140 

AAAAGAAAGA TGTCCCGCTT CTACGTCGTA GTGTTGGGGT TGTCTTCCAG GATTATAAAT 72 00 

TGTTACCAAA GAAAACTGTC TATGAAAATA TTGCTTACGC TATGGAAGTA ATCGGGGAAA 72 60 

ATCGCCGTAA TATCAAAAGA CGAGTGATGG AAGTTTTGGA CTTGGTTGGA T T G AAG CAT A 7320 

AGGTTCGTTC TTTCCCAAAT GAACTCTCAG GTGGGGAGCA ACAGCGGATT GCGATTGCGC 7 3 80 

GTGCAATTGT AAATAATCCC AAAGTATTGA TAGCTGATGA GCCAACAGGA AATCTGGATC 7440 

CGGATAATTC ATGGGAAATT ATGAATCTCT TGGAACGGAT TAACyTACAA GGAACAACTA 7 500 

TTTTGATGGC GACTCATAAT AGCCAGATTG TAAATACCTT GCGCCACCGT GTCATTGCCA 7560 

TTGAAAATGG CCGTGTCGTT CGTGACGAAT CAAAAGGAGA GTATGGATAC GATGATTAGT 7 620 

AGATTTTTTC GCCATTTATT TGAAGCCTTA AAAAGTTTGA AACGAAATGG TTGGATGACA 7 68 0 

GTAGCTGCTG TCAGTTCAGT CATGATTACT TTGACCTTGG TGGCAATATT TGCATCTGTT 7740 

ATTTTCAATA CAGCGAAACT AGCTACAGAT ATTGAAAATA ATGTCCGTGT AGTAGTTTAT 7800 

ATCCGAAAGG ATGTGGAAGA TAATAGTCAG ACAATTGAAA AAGAAGGTCA AACTGTTACA 7860 

AATAATGACT ACCACAAGGT ATATGATTCT TTGAAGAACA TGTCTACGGT TAAAAGTGTT 7 92 0 

ACCTTTTCAA GTAAAGAAGA ACAATATGAA AAATTAACCG AGATAATGGG AGATAACTGG 7980 

AAAATCTTTG AAGGAGATGC CAATCCTCTC TATGATGCCT ATATTGTAGA GGCAAACACT 8040 

CCAAATGATG TAAAAACTAT AGCCGAAGAT GCTAAAAAAA TTGAAGGTGT CTCTGAGGTT 8100 

CAAGATGGCG GTGCCAATAC AGAAAGACTC TTCAAGTTAG CTTCATTTAT CCGTGTTTGG 8160 

GGACTAGGGA TTGCTGCTTT GTTAATTTTT ATCGCAGTTT TCTTGATTTC AAATACCATT 82 2 0 

CGTATTACCA TTATTTCCCG CAGTCGCGAA ATTCAAATCA TGCGCTTGGT CGGAGCTAAA 8280 

AACAGTTATA TCCGTGGACC GTTCTTGTTA GAAGGAGCCT TTATCGGTTT ATTGGGAGCT 834 0 
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ATCGCACCAT CTGTTTTGGT CTTTATTGTT TATCAAATTG TTTACCAATC TGTCAACAAA 84 00 

TCGTTGGTAG GGCAAAATCT ATCCATGATT AGTCCAGATT TATTTAGTCC GTTGATGATT 8460 

GCCCTACTAT TTGTGATTGG GGTTTTCATT GGTTCATTGG GATCAGGAAT ATCCATGCGC 8520 

CGATTCTTGA AGATTTAGGT AAAATAGCTG CTTTTATGAG GAGATTGTAA AATCTCCTTT 85 80 

TTTGCTACAA GAGTTTTTGA AAAGAGATGC GCAGAAGAAA AGAGCTTCCA AAGAAGTCCC 8640 

CCAGAGAAGA CTTC 8 654 
(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 
TGTCGCGTCA AAATCATTAC TATGGCTATG TATAGCCCTT ACTATGACTT GGCTAAACAC 60 
GTTCGCTTTC AAATTTCTAG GCTCAGGCTG AAACAGTCTC CCAGGCTGTT CACTCCCGAA 12 0 

TGCTAAAATC GTTCTTGATC GCTTTCACAT TGTACAACAT CTTAGCCGTG CTATGAGTCG 180 
TGTGCATGTC CAAATCATGA ATCAGTTTCA TCGAAAATCC CATGAATACA A 
GCGCTACTGG AAACTCATTC AACAGGATAG CCGTAAACTG AGTGATAAGC C 
CCCTACTTTT CGCATGCACT TAACAAATAA AGAAATTCTT GACAAGATTT TAAGCTATTC 
AGAAGACTTG AAACACCACT ATCAGATCTA TCAACTCTTA CTTTTTCACT TTCAGAACAA 
AGACCCTGAG AAATTTTTCG GACTCATTGA GGACAATCTG AAGCAGGTTC ATCCTCTTTT 
TCAGACTGTC TTTAAAACCT TTCTCAAAGA TAAAGAAAAG ATTATCAACG CCCTTCAACT 
ACACTATTCT AATGCCAAAC TGGAAGCGAC CAATAATCTC ATCAAACTTA TCAAGCGCAA 
TGCCTTTGGT TTTCGAAACT TTGAAAACTT CAAAAAACGG ATTTTTATCG CTTTGAACAT 
CAAAAAAGAA AGGACGAAAT TTGTCCTTTC TCGAGCTTAG CTGACTTCAA CCCACTACAG 
TTGACAAAGA GCCTAATTTC CATAAAAATT GACATGGAAA TTATAAAACC ATTACTAGTT 
TAGTCCTTTT TGATAACGTG CCAATTCGGC TTGGTTCGCC CAAACATAGT GACCTGGACG 
GATTTCTACC ATAGATGGCT TATCAGTCTC ATAGTCGTGT TGACTTGGAT CGTAAACCTT 
CAAGACCTTC TTACGTTCCA AGATTGGATC TGGGATTGGT ACCGCTGAAA GCAAGGCTTG 
AGTATATGGG TGAATTGGAT TGTTAAACAA TTCTTCTGTT TCTGCAACCT CTACAATAAC 
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ACCCTTGTAA ATAACTGCGA TACGATCTGA AATAAAGCGA ACAACCGACA AGTCATGGGC 1080 

GATGAAGAGA TAGGTCAGGC CGAGCTCTTT TTGGAATTTT TTGAGCAAGT TCAAGACTTG 1140 

GGCACGTACA GAAACGTCCA AGGCTGAAAT TGGCTCATCT GCAATAACAA AGTCTGGTTG 1200 

CATGACCAAG GCACGGGCAA TACCGATACG TTGACGTTGA CCGCCTGAGA ATTCATGAGG 1260 

GTAACGAGTC AAGTGCTCAG CAAGAAGACC TACTTCACGG ATAATATTTT GAACTTTCTC 1320 

TTTACGTTCT TCTTCATCCT TAAATAAACG GTGATTGTAA AGACCTTCAG AAATAATATA 1380 

ATCAACAGTC GCACGTTCAT TCAAACTTGC GGCAGGGTCT TGGAAAATCA TCTGGATTCG 1440 

ACGAATCAAT TCCGCAGCTT GTTCACGCGA TTTCTTACCA TTAATCTTTT GACCATCAAA 1500 

AATGATATCT CCATTACTTG TATCATTTAG ACCGATGATA GCACGACCAA TAGTTGTTTT 15 60 

CCCACTACCG GACTCACCTA CAAGCGAGAA AGTTTCTCCC TTGTTGATAA AGAAGTTAGC 1620 

ATTTTTAACC GCGACAAACT TCTTACTTCC TTCACCGAAG GAAATTTCTA AATCTTTGAT 1680 

TTCTACTAAT TTTTCAGACA TTTCCTTCCT CCTAGTCAGC CAGATGGGCA AATCCCATTT 1740 

TTTCACGGAT CTTATCATGG AGATTTGCAA TCACAGCTGG TTTTTCTACT TTCGGAGCAT 1800 

CCTCATGAAG AAGCCAAGTT TTAGCCCAAT GTGTCTCTGA TACTGAGAAT TGAGGAGCTT 1850 

TTTGTTCGAA GTCAATCTGC ATTGCGTAGT CAGAACGCAA GGCAAAAGCA TCCCCTTTCA 1920 

GGTCAGTATA AAGTGACGGA GGTGTTCCTG GGATTGAGTA AAGATCCCCT TTATCATCAG 1980 

CAAGCTGAGG CAAGCTAGAC AAGAGACTCC ATGTATATGG ATGGCGAGGG TCATAGAAGA 2 04 0 

CTTCCTCAAC CGTTCCATAC TCAACGATTT CTCCTGCATA CATAACCGCT ACCTTATCCG 2100 

CAATACTTGC CACCACACCA AGGTCGTGGG TAATAAAGAT TGTTGTGAAA TGATACTCGT 2150 

TTTGTAAAGA TTTTAGCAAA TCAATAATCT GAGCTTGAAT AGTTACATCC AAGGCAGTTG 2220 

TTGGCTCATC ACAGATCAAG ACATCAGGTC GGCAGGCAAG GGCAATAGCA ATAACGATAC 22 80 

GTTGACGCAT TCCTCCAGAA TATTGGAATG GGTATTCATT AAAACGTCTA TCTGCGTCTG 2 3 40 

GAATGCCAAC CTTATTCATG TAGTCAATGG CCAATTCTTT CGCTTCTTTA GCTGTTTTTC 24 00 

CTTGGTGTTT TACAATAACT TCTGTAATCT GACTACCAAT TGTTTTAATG GGGTCCAAAC 24 60 

TAGTCATTGG GTCCTGGAAG ATAGTCGCAA TCTTAGCACC ACGAATTTGT TCCCAATCCT 2 52 0 

TGTGAGAAGA TAAAGCTGTC AAGTCCTGAC CACGGTAGTC AATACTACCT TGGGCAATAC 25 80 

GACCATTTTC TTCGAGCATA CCTGTGAAGG TCTTTGTCAA AACAGATTTA CCTGATCCTG 2 640 

ACTCACCTAC CAAGGCTAAT ACTTCTCCTT CGACTAGTTC AAGGGAAACG CCGCGAATGG 27 00 

CTGTCAATAC TTTGTCACGA ACGTCAAATT CCACGACAAT ATCGCGAGCA GTCAAAATTA 27 60 

CATTTTTTTC TTTTGTCATT TCTACTCCTA TCTATGTGTA CGTGGATCAC TAGCATCCGC 2 820 
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TAAGTTTTGA CCAACTACGA AAAGGGACAA GGATACCAAG ACAAGGGTTG TCAATGGAAT 2 880 

CCAGAACAAG TAAGCATTGG TTGTTACGTT TTGTGAATAA TCCGAAATCA AACGACCCAA 2 940 

ACTTGGCACT GTAATCGGTA ATCCAAGACC GAAGAAAGAC AAGAAGGCTT CGTATGAGAT 3 000 

AAAGCTTGGA AGCATTTGAG TCATGGTTGT CACAATAACA GATACCAATT GAGGCATGAT 3060 

ATTTTTGGCA ACAATCTTCA AGGTTGGTGT TCCCAAAGTA CGTGACGCCA AGTTGTATTC 3120 

CAAGTCACGA TAGCGCAAGA TTTGCACACG GATCATGAAG GCAATACCAA TCCATGTTGT 3180 

TACGCTCATG GCAAAAATCA GATTCCAGAA TCCAGCTCCG ATTGAGTAAG TCAAGACAAT 3240 

AACAATCAAA AGAGGTGGGA TGTTTGAGAT GACGTTGTAA ACTTCCATCA TGACACGGTC 33 00 

AACTGATTTT GAAATACCCC AAATACCACC GACAAAAACA CCGATAACCA AGTTAATCAC 33 60 

TGTCGCAATC ACAGAAATGA GGATGGAGTT ACGAGCTCCG AACCAGACAC CGTCAAAGAG 342 0 

CGATTTACCG TTACTGTCAG TACCGAACCA ATGCTCCGCA TTTGGCTTGA TATAACGAAC 3 4 80 

ACTAAAGTCG TTTACCTTGC TGACATCATT GAAATCAAAC TTAGAAAACA TTGGGTAGAT 3540 

GAAACTTATC AAAATGATGG CTACCAAGAT TCCCAACATG ACTACAGTTG ATTTTTTCTT 3600 

CATAAATTGT TTAAACACTG ATTTCCAGTA AGAATATGCT GGCGCATCAA TAGTTTCAGA 3 660 

GGCAAAATCG TCACGTTTTA CAAACTGAAA TTTTTCTTTA TCGATTGTAG ACATTATTTG 3720 

CCTCCTTTCT CAGTCAATTT AATACGTGGG TCAATAATAG TCATCCAAAT ATCTCCCAAA 37 8 0 

AGACGTGAGA AGATAGAAAT ACATGTAAAG ATGAAGACAA GACCAACGAC CATAGAGTTA 3840 

TTAGATGCTT TTACAGAGTC AATCAACATT TTACCCATAC CTGGGAAGGC GAAGACTGTT 3900 

TCAGTAAGGG TTGCACCACC GATAACCCCA ATAATGGCAG CAGGAATTCC TGAAACCAGC 39 60 

GGAACCATGG CATTTTTAAA GATGTGTTTG TTTGAAATTT CTTTTTCAGA CAAACCTTTT 402 0 

GCACGAGCGA AACGAACAAA GTCTTGAGAT TGCAAGTCAA TCATGTAACG ACGAATCCAA 408 0 

ATGGCTGTAC CAGGAGCACC CAACAAACCA AGGATGACTG CTGGTAAAAC GTAAGAACGC 4140 

CAATCTCCAG CTCCCAAGAT AGGGAATGAA TCTGGAAGGG CAATAGATGA TCCAATCAAT 42 00 

CGAACGATGT AAACCAAGGC AATCGTTGGA AG AGC AAGC A AGAAGGTCAA AGCCCCTGTT 42 60 

GAGAGGCTAT CAATCCAAGT GTTCTTGAAA CGAGCCATGG CTGAACCAAG TGGCACGGCA 432 0 

AGAGCATAGG CAAGAACCAA ACCAATCAAA CCAGTAATAG CAGAGCTGAC AATCATAGAT 43 80 

GGATATTGGT AATTACTTTC AGTCGCTGTA TAAGGATCAT CTTTCCCATA GCTAGCTACT 4440 

TCACGAGAGT CAGCCTGACT AGGTGACTTG TAGGTTCTTG AGTAAATATT TACAGAAGAC 4500 

GTTTTCTTAC CTGTTGGGAA CTGAACTTGG GCAGTTTTGG TTTGTCCTTG ACCTTGAGTA 4560 



WO 98/18931 



PCT/US97/19588 



754 

ATAACCTGAA GAACTGGTGT ATTAGCATAG GTTGGGTAAG AGTCACCTAA ATTCAAGTTC 462 0 

ACAAAGTTTT GAT G AACAAA TGGGAACTGA CTGTTAAAGT ACAAGAGATA TTTATGTTTA 4 680 

GTTCCTGAAC CGACCAATGA CCATCCGATA GCTGGATCAT TTTCAAAACG AAGGTAGCGT 4740 

TTCAAGTCTG GATTTTCAGG GTCTTGGATT TTATTTGTAT GGTCAATGTC AATCAAGTTA 4800 

GCATAGAAGT GAAAAACACG TTCAAAAATT GGAATTTCAC GAGTAGCATA GAATTGACCA 4860 

CTTTCAGTAA ATTCTCCCAA AGTCCAACCA TGACCTAATT GATTGATGTA CTTTTCATAA 492 0 

ATAGCTTTAT TGGTCGCATT TGCTTCTACT GTTACAGAAG AATCCATGCT ACTTGCCTTT 4980 

TCTTGCAACT CTTTAGTATC GTAATACTCA ATGTAGCCCA TACGCTCAAA CACAGTATTT 5040 

TCATAGTTAT CACGTTTATC AGCCGTTGTC GCAATTTTAT TATAGTTAGG ATCCTGCTTG 5100 

AAAATCAATT TTCGAGGAAC CAAGGTATAG ATAATCGTGT AGGTCAAAGT CGTTACTAAG 5160 

AAAATCGAAA CCAATGACCG CAAAACACGC ATAAAAATAT ATTTTTTCAT ATTATTTCCT 5220 

TTAAAAATCC CAAAAGAACC TTCTCCTCAT GGAGAGAAAG TTCTATTAGA AATTATTTAC 52 80 

TTCACATGAC TTGCCAATTC TTTTTGAGCT TTCTCATTTG ATTCAGCTTT TTCTTTCAAC 5340 

CATTTTTCAC GAGCTTTTTC ATACTCTTCC TTAGTCACCA CTTTATCTTG TGATTTCAAA 5400 

TATTTGAAGT AAACATCTGA CCCCTTAGAG CCTGTTTGCG CAGAAGCTCC AGTAAATGGA 5460 

ACAATTCGTG AAAGCACTGG TGCTGCACCA GAAGAAGCCA TAGCAGGAAT AAAGAGTGAA 5520 

CTATCTGTCA ACCATGCTTG AGCCGCTGCA TATTTTTCAT AACGGACATT CAAGTCGCTT 5580 

GTCTCTCTGG CAGCTTCATC AACTAATTTA TCGTATTCTT TCAAACCAAC TTGAACTACT 5640 

GAAGGGCTAT TTGGATTATC AAATCCTAAA TATGTTTTTG TAGTTTCACT GCTAGTTGTT 5700 

TTTAAAATAT CCAGGTAAGT AGATGGGTCT TGATAGTCTG GCCCCCATGA AACTCCTCCT 5760 

GATACATCCC AATCCTCAGA TGAAGCATTG GCAGCATAGT AAGTAATATT AAGGAATTCA 5820 

TCACTTGTCA TTTGTTGAAT ATCAACAACG ACATTTTCAA CACCAAGAAC TGTTTCTACA 5 880 

GATTGTTTAA AGGACTGAAT ACGAGATATG TAGTTTTTTG ATGCTTGGTC TACTGGAACG 5940 

TCCAGATGAA TAGGAAACTG AACGCCGTCT GCTTCTAAAG CTTTCTTAGC TTTCGCAAAC 6000 

TCTGCCTTGG CCTTGTCAGC ATTGAATAAA CCATCCTGCC CATCAGCTAA ATTCACACCT 6060 

TTCCACTCAT CACCATAAGC AGGAAGTTGA GCAGCGACTA AATCACCAAA GGTCTTCTCA 6120 

CCAGCTGAAA CAAAGTCTGG TTTTACAAAT AAATTACGAA CTGCTAAAGC TGCTCCATCT 6180 

TTACCATTGA TTTGAGCTGA GTAAGCTGAG CGATCAAGAG CAAAATTCAA GGCTTGACGG 6240 

AAATCTTTGT TAAGCAATGC CTTCTTAGTA GCTACTTTCT CTGAATCTGT AGTTTTAGAA 6300 

GTATAGTTGT AACTTTGGCG ATCAATATTC ACACCCAGAC CAGCAATCCC AGAGCCTGAT 6360 
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TGTGTGTAAT AGATATTGTC CTTGTATTCT TCTGCAACCT TAGAATAGTT GGAGCTGGTA 6420 

GGGTAAAGAC GGGCATAACT ATAAGCTCCA CTAGTGAAGT TACGCTCTAG CGACTCCTGA 6480 

TCTGATCCAT CATAGTAAGC TAGATTGATA GTATCTAGGT GGACATTTTC TTTATCCCAA 6540 

TATTGCTCAT TTTTTACAAA CTCTACAGAA GATTTTGCAG TCAACCCTTT CAACAAGAAT 6600 

GGACCATTAT AAAGCAAGGA TGTCGGATCT GTTGGTTTAG CAAAATCGCT TCCTTTTGAT 6660 

GTTTCGAATT CTTCATTCAG AGGCCAGAAA ATAGAATAGG TCAACTTAGA GTTCCAGAAC 672 0 

GGTTCAGGCT GGTTCAAAGT GTATTGTAAC GTATAATCAT CAACCGCCTT GACACCAACT 67 80 

GTTGAAAAAT CTGTTGAAGT TCCTGATAGA TAATCTGCCA AGCCTTTAAC CGAATTTTCA 6840 

GCTAAATACA TAGCTTCTGA TTTTTTATCT GCTGCGTGTT TTAAACCGTT CACGAAATCT 6900 

TTAGCCGTCA CCTCTGCATA TTCTTCTCCA TCAGAGGTAA ACCATTTAAC CCCTTTACGA 6960 

ATCTTATAAG TGTAGGTCAA ACCATCCTTA GAGACTTCCC AATCCTCTGC AACTGCAGGA 702 0 

GCAAGATTAC CGTAATTATC GTTAGTGAAT AAACCATCAA TCCCATTTGA AGTCACTACT 7 080 

GTTGTACTAT TTTTACTTGA AATCAGGTAG TCCAAGGTTT CTGGGTCTGC TGTATAAACA 7140 

TAGCCATAAG CTTTAGGGGC TGATGAATCA GATGATTTTG AAGAACTGCA TGCTGCAAGT 72 00 

ACACCTGCTG CTAATAAAAC AAGACCTGCT GTAGCAAATA CACGATTTTT TTTCATTTTC 72 60 

TACTCCTCTG TTTATGTGAA TTATAGATTG ACAACCATTA TATCACATTA TCCATTAAAA 732 0 

ATCAAACAAA TTTTCAGAAT ATTTAGGCTT GTTGGCACAA ATTTTTCATT TTTTTTGAAT 7380 

ATATGATTCA AATTGTCGTT CGAAGTGTCA AAGACTACAG TGAAAATAGG AAATTTGACG 7440 

CAGAAACTTT GGAGTTTAGG AAGACATACA GTAAAATGAA ATACGGACGG AACAATGTGA 7500 

TTTTGGAATT CAAATTAAAT TATAACAATA TTGTAGAAGT ATCATTCTAG TATTCAAGAT 7560 

TCAGTTTACT ATGTCTTTTC ACACCAACCT TATCCCGAAT TCAATTACTT TTGTGATTTA 762 0 

CATATATAGA TTAAGACTAT CTTTTATACT TTAAAATTTC TCGCTACCTT ATCCACTATA 7680 

TGCTCCTCGC TATCACGTTT CTATTCATAG CCTACGATTT CACTATTGCT TTCTCTGACA 7740 

ATTCTTATTT CCTGCGTCAG ACTTAAAACG ATCTATCCCC AGACCATTTT AATCCGCTAC 7800 

CTCACGATAG TCAGGCTTGG GGAGCGCTAT TGTATTCACC GGTAGTGGAG CCCTACAGAG 7860 

GACTTACACC TCAGATGCAC GACATGCCCA TCGTATAAAA AATCTCCTAC CCAAGGTAGA 792 0 

AGATTTCAAA CTTATAAAAC TTAATCCGTC ATGTCCGATA CCAACATTCG ATGCTCCAAT 7980 

GGAATACTGC ACATAACTAG CAAGAAAATA AAGCCTGACT GAATCCAGAA GAGAGCCAAG 804 0 

TCAAAAATTC CGTGCACAGC AACCACTGTA AGGAAAGATA GATAAAGGCC GATAATCGGA 8100 
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CGTTTCCCCG ACTCCTGACT CATATCCATC ATCAAGCGAA CAGGAGCAAC AGAAGACAAA 8160 

ACTAATAAAA TAGTCCCCAC AATTCCGTAA CTCAGAATCG TATCAATATA AAGACTGTGG 82 2 0 

GCATGTTCAT GATAAGGAGC ATGTATCCGA GGATAAGAGT TCATATAGGT CAATGGCCCT 8280 

TCACCCCAAA AAGGATTTTG CTTAAACAAG GCCATCCCAG CATCCCAGAT AGAAATGCGT 834 0 

TCTTCCATAG AAGAGTCTAA AGTACCCATT CGAACTCCCA AATCACTAGA AAAGAGGAAA 8400 

CTCAAACCAA TCGCGAAGAC CCCAATACTA AGCCAAAAGG CCTTCCAGTT TTTAATAGTC 8460 

GTAAAGAGAT AGATAATTGC TCCAGCGATA ATAGCAGGAA AGGCAGTTCG ATTTTGAGTA 852 0 

AAGTTCAAAC CAAAGAGATT AACAAAGCCT GCAATCACAC AGAATACTTT CAACCAATTC 8580 

AACTTGGTCG TTGTAAACAG ATAGAAAGCA ATCATAATAC AGAAACAACA AATAATTCCA 864 0 

TAATAATTAG GATTAAAGAA GGTCACTTCT GCCCGGTTCT GATGCCACAC CTGCATATTG 8700 

GGTGAAAGAA AAGCATAGTT AAATTTCTTC ACAATTTGGA AATGTTCTAA ACTGGCAAAA 87 60 

GCAGCTGACA AGACACTACC AAACAAGACA AACTGCAAAA TCAATCGAAA GAATTTATGG 882 0 

GATAAAATCG ACTGATAGTG CAAAAAGAAA ATAGTAAATA GAAACATTCC TACTGAAGCC 888 0 

ACAAGACCCA TCCAATTTTG TGCAAGAATG GATATAACAG TACTATAGCT AAGAAAAAGA 894 0 

AGCAGCATCG GATGCTCCCC CATTTTCTGA AGAATACTTT TCATGTCTCC TGTAAAAATC 9000 

AAACTGATAA TATATAAACA GAGTACAACT ACAAAAAGAT AAAAGGGTAA AAAGATACTC 9060 

AGGATAATTC CCAATAAAAT CAGCTCTTTA CTAGACAACC CCTTCAGCTT TTCAATAAAG 9120 

CCTATTGATT TCAAAATGAA TCCTTTCTCT CCAAATCAGC TGATTCAGAT AATAGTAAGC 9180 

TATCCTATAT TGTACCACTT TTTTAGCAAT TTGAAAACAA AGGAAACGTT TTCCAAAATA 924 0 

AAAACCCTAT TTTATCCACC ATATCAAGGC TTCAAAATGA TACTTCAACT CCATTCTCAA 9300 

TTACCCGATA AGTCTGATTT TGCAAATCAA TTTCTACTAC TGCTGTTACG GACTTATCTT 9360 

TATTTTGACG TTTGATTACA ATGCTGTGAG CTGTTGGTGT CTCTATCTCA GTAGTCCCTT 942 0 

CTAGATCAAA GGCTTCTGAA CGGTTACGGA AAGAAAATAG ATTGAGAAGG GCCTTCACAA 948 0 

CAGGTCGTTG CACTTCTTTT GCTATTTCCT CGTTGCTATA GTAATGACGA TTAATATTTC 954 0 

GACCTTCTTT AGTTTCTTCT AATAATTTCA AGTCATTCTT GCCTGCTAAT AGACCCACAT 9600 

AGTAAATCTG AGGAATACCT GGGGCAAAAG CTTGAATTAG ACGAGCGAGA AAATACTTGA 9660 

CATCATCATC TCCAAGCGCT GAATAGTAGG TTGAATTGAT TTGGTAGATA TCTAAGTTGT 9720 

TATACTCGGC ACTAGAGTAC TTACGTTTGA CATTGGCTCC AACCTTATAG AGTTCATTTG 9780 

AAGCATAGTC AATCTCCTCA TCGGTCAGGA TATCCTTGAC ATCTACTACT CCAATCCCAT 984 0 

CATGGGTATC TAGCGTCGTA AATTGCTTCA TCGGGCTCAT CTTTAACCAC TTAGCCAAAC 9900 
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GCTCTGTTCT GGAACTGTAA AGAGTATAAA GTGTCACCAT TGGAAGAGCA AAATCATAAA 9960 

CATAGTAATC ATGGTCTGCT ATTTTAAACT GAATCGAATA GTGTTCATGA ATCTCAGGTA 10020 

AAAGCTCTGT CCCATACTCA GCAGCGATAT CTCGAACTTT GTCCAATAAA TCCCAAATAT 10080 

CTGGTTCCAC AAAGAAATCA TTAGTATCCA ATTTCTTCAC TGCATAAGCA AAGGCATCTA 10140 

GACGAATCAA ATCACACCCA TTACTTGCCA AGTGCTGAAT GGTCTTACGG ATAAATTCCA 10200 

TAGTTACTTC TTTGGTCACA TCAAGATCAA TCTGCTCCTC ACCAAAGGTA TTCCACAAAT 10260 

GTTCCACTGA ACCATCTTCA AACACAATCT CTTGCTTTGG TGCACGATCC TTACGCTTGT 10320 

AAATTAAATC TACATCAGAC TGTGTCGGAC GGTTTTCTGG CCAAAACTTA TCCCAGTTTA 103 80 

AAAAGAGAGC TTTAAATTCA CTGGCTTCAT GTTTTTCTTG ATAGTCCTTA TAATACTTGG 10440 

ATTGACGAGA AATATGATTA ATCATAAAAT CAAACATAAG ATAATATTTC TCACCTAAAC 10500 

GCTTCACATC CTCCCAATCA CCAAAAGCTG AGTCCACTTC GTCGTAGTCA ACTGGCGCAA 105 60 

ATCCACGATC AACTGTTGAT GGGAAAAATG GTAAAAGGTG AACTCCTCCA ATAGCATCTC 10620 

CAAAATGCTC TTCCAAATTA TCATATAAGT CTTTAAGATT ATTTCCAAGG CTATCAGAAT 10680 

AGGTAATCAA CATGGTTTTA TTTTGAATTG GCATCATTAC TCTCCTTTTT CTAATTGAAG 1074 0 

CCAAGTCTCA TATGATCTGG CTTCATAAAT AAAATTCATT TTAAATCTCT ATTTATCATC 10800 

AAACTCGTAC TAATATAGAC TGTGATAAAC AAAGTACTAC TTTCTTGTTT TCTGCATAGA 108 60 

ATTATCAACA AGCTAAACTC TTCCTCTGTG TCAAAGACTA TAGATTCCAT GAGCTCTTCT 10920 

TATACTCTTC GAAAATCTCT TCAAACCACG TCAGCTTCAC CTTGCCGTAG GTATGGTTAC 10980 

TGACTTCGTC AGTTTCATCC ACAACCTCAA AACAGTGTTT TGAGCAACCT GCGGCTAGCT 11040 

TCCTAGTTTG CTCTTTGATT TTCATTGAGT ATTACTTCAC TGCCCCGTTG CTCATTCCTG 11100 

AAATGATATG GCGTTGGAAG AAGAGATAGA CAATGGTGAT ACTGATAATG CCGACCACGT 11160 

AAGAGGCAAA GCTTGGTCCG TAGTCGTTGA AATATTGGCC TGCGTAGTTG TATTGGAACA 112 2 0 

AAGGCAGAGT CCACATTTTG GAATCCCGGT TCAAGACAAG GAGTGGCAAC ATGAAGTCAT 112 80 

TCCAGAACCA AAGGGCATTG ATGATCATGG TTGTCGCATG CATCGGTTTC ATCATTGGGA 11340 

AGATGATGCG GAAATAGGTT GTAAATTGAT TAGCCCCATC GATCTCTGCT GCTTCATCCA 114 00 

GACTTTCTGG AATCGAGATT TTGATATAGC CAACATAGAG AAAGAGGGTC TGTGGAATCG 11460 

CATAGGTCAA GTAGAGCAAG ATCAAACCAA AGGTATTAGC CAAACCGAGT TTACTCATCA 11520 

TAACCGTAAT CGGAATCATG ATGACTTGGA AAGGTACGAA GATTCCGAGG ATTAAGAGGG 11580 

TATACATGAT GGTAAAGGCT TTTCTTTTAC TCATATTGCG AGCGATGGAG TAGGCTGCCA 11640 
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TAGGGATAAA GATCATTACT GCAAGTAAAG ACAAGACAGT GATGACGACA GAGTTCCAAT 11700 

AATAGCCTCC AATCCCATCA GCTAAGAGAC GGCTAAAGTT GTCCCATGTG AAGTTGGTTG 117 60 

GAAAGCCAAA GAAATTATCT ACAATATCCT TAGTGGGTTT GAAGGAACTA AAGAGGGTAG 1182 0 

CAAGGAGCGG CACTAAAATC AGAACCGATC CTAGAATCAA TAGAATGTAT TTGCCAATCA 11880 

GGGCTTTTCT TTCATCTTGT TTCATCATGC TTCTCCTCTT AAATTTCAAA TTTCTTAGAT 11940 

ACTCTCAATT GGATGATCGA AATCACTACA ATTAAGAAGA ACAAGATTAC GGCAATGGCA 12000 

TTGGCATAAC CGAATTGGTT GTTTTTAAAG GCATAGTTAT AAACCAAGAG CCCAAGTGAG 12060 

GTTGTGGCAT TGTTTGGACC ACCACCGGTC ATGGCAAAGA CTTGGTCAAA GGCAGTCAGC 12120 

CCACCTTTTA GGGCTAGGAT AAAGACCATA GAGACACTTG GTAGCAAGTA AGGCAATTCA 12180 

ATGTTCCAGA AAACTTGCTT GCTAGTCGCA CCATCAATCC TTGCTGCCTC TGTAATCTCA 12240 

GTTGGAATAG ATTGCAAACC AGCTAGGAAG ATGATGATGG GCATAGCCAC CCCTTGCCAA 12300 

AGAAGGACAA AGACAGCCGC AAAGATTGCT CCCCACTTAG TCCCTAAAAG ACTGGTTTGG 12360 

AAAAATTCAA TATGAAGGGC ATTTCCAATC GCTGGAAGAC CGTAGTTGAA GACTTGCTTG 12420 

AAGATCAAAG CCACTGTCAA ACCAGATAAA ACAGCTGGGA AGAAGAAC C A AGCACGGAAG 12480 

AAGGTTTGGC CTTTGATTTT AGAATTCAAG ACACGCGCAA TGAAGATCCC GAGTGCAATC 12 540 

TCACCAACCA CCATGGCAAT CGCAATGATT GCGGTAAAGC CAATCGCATT CATGAATTTT 12 600 

GGATCCATGA AGAGGAGCTT AAAGTTGTTT AAGCCAACAA ATTTGTAGTT ATAAGTCAAT 12660 

CCTGTCCAGT TGGTAAAACT GTAAAAGGCT CCTTGAAACA TCGGCACATA GAAGAAAATT 12720 

GCTTGTAACA AGAGGGGGAT GACCACAAAA GCCCATGCCC AATATTTTTG TAATACTTTT 12 780 

TTCATAGTCT CTCTACTCCT AATCCACATC CGCTTTCATC GGGTTAAAGA AGGCATTCAA 12840 

ATCATTGACC ATGCCTTGTT TATCACCGGT CAAGACATAG TTCATGGTCA AGGTATGGAA 12 900 

GTCTGCTTCA CTGGTCCAGT ATTGTTGCAA CCAGACCAAG TGACGATCCG TAAAGGCATA 12 960 

TTCGGTCATA CCAGCAAGCG GTGAATCTTC TCCTGCTTGT TTGACCCCTT CGATCGCTGT 13020 

TGGAGATCCG TCCACATCGT AGTATTTTTG CATGACTTCT GGACGGGTCA TATATTCCAC 13 080 

AAAGGCATTG GCTTCTTTTG GATGTTTGGT GGTGGCTGAG ATAGACCATG CCAAGTCTCC 13140 

CGCACCAACG GTTAAGCTTT GTCCTTTTTC TTTTCCTGGA ATCATGAAGG TCCCAATCTT 13200 

AAAGTTCGGT TTTTGTTCAT TAATCGCTGT GATCGCCCAA GACCCATTTG GTGTCATGAG 13 260 

GACATCCCCA CGTGCGAAGG CTCCGATAAC ATCGGTATAG CCAGCACCTT CCCAGTTCTT 13320 

TTGCTTAGAT CCATTGATGC GAAGGATGTC CATGACCTTG ATATCATCTT TCATAATCGG 13380 

ATCCGACAAT TTAATGGCAT TTGGTTGAGA ATAACGAAGG TATTGATTTG CTTCTTTTCC 13440 
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TCCACCTGTT GCTGTCGCAA AGGCTAATTG ATTGTAACCA TTGAGTGTCC AAGCATCTGC 13 500 

ACCTGCAATT CCAAATGGTG TTTGTCCTTT AGCAACGATA TCTTTGACTA ACTGTTCAAA 13 560 

TTCATCCCAG GTTTCAGGAA CCTTCAAGCC CAGTTCTTCG AATTTATCTT TGTTGTAGTA 13 620 

AATTCCATAA GCATTAGCTG TAAAAGGAAC GTTGTAAACT TTTTCGTTTA CAGCATATTT 13 680 

TTCAGCGTAG CCATTTTTCA CGCGTTTCAG GTAGTCTTTG TTGCTCAAAT CTTCAAAAAC 13740 

ACCTGCTTTT GCCCATTCTT GCAGTTCGAT GGACTGTGGG TAAATATTGA CCACATCAGG 13 800 

CACATCTCCT GCGAGAACGC GTGTCTTCAA TACTTCACCA GCATTTGGTA CATTGACGAC 13860 

TTTGACCTTG ATCTTAGGGT TTTCCTTCTC AAAATCACGA GTGATTTCTT CCAAGGTTTT 13 920 

GGTCATTTCT TTTTTCTGGT TGAAATACTC GATGGTCACT GTGCCATCCG CAGATTTACC 13 980 

ATAGTTGGAG CAAGCGCCGA GCCCAAACAA AGCTAAACCT GTAGTTGCAA GAAGTCCGAT 14 040 

TTTTTTATAC CATTCCATTA GAAAGCCTCC TTTATAAATT TATACACCCT TATTGAACTG 14100 

CACCCCAAAA GTTAGACAGA ATAAATCTAA CTTTTGGGGT CAGTACATAT CATAGTTTTC 14160 

TAAAAATATA CTGTCTACTC AAAAAATCTC CTTGGGATAA GATAACAGTT AAGCCCGCAT 14 2 20 

ACATTAGTTC TGCACCTGAG TAAACTTCGC CATTTTCCTG TAATTTATAT AGTCCCTCTT 14 2 80 

CATCCAAATC TTTTAATTTT AAAGTTGTTT CCATGGTCTC TACAACAGAT AAAACGCGAA 14340 

CGTAGGTTAC AATCGTTTGA TTTCCGTAAT TAAATTGTAC AGCTGCTTCA TTGGATACAG 14400 

TATCAGGATT AATTAGTCTA TACTGCTGTC CTAACTGAAC TACTGGTCGT AATTCTTTAT 144 60 

ACAAGTTCAC CTGATTAGCA ATCGTAGCTT TCTCTTCATC TGATAAATTT GTCAAATCAA 14520 

GTTCATAGCC CAAATTTCCC ATCATTGCTA CAAGGCCACG TGTTTCTAAT GGTGTCATTC 14 580 

GTCCCATCTG ATGATTCGGT ACTGCTGACA CATGAGCCCC C AT AG AAAT G GTTGGATAGA 14 640 

GATAGGATGA ACCGTATTGA ATTGGTAAAC GTGCAATGGC ATCAGTATTA TCACTAGCCC 147 00 

AGACTTGTGG GAAATAGCGC ATCATACCAA GATCATTTCG TCCACCACCA CCAGAGCAGG 147 60 

ACTCAAAGAG AATATGGCTG TGCTTCTCTG TCAGATAAGA AACGAGTTCA TAAAGCCCCA 14 820 

GCATGTACTG ATGAGATTGC ATCTGTGTCT CTAGATAAGT TAATCCATTC CCTAGCTTAG 14 880 

TGATATTGCG GTTCATATCC CATTTAATGT AATCAATATC ATGATAAAAT AGGAGTTGAT 14940 

CTAAGACACT TTTCAAGTAT TCTACTACCT GAGGATTGGC AAGATTAAGT ACTAATTGAT 15000 

TCCGAGAATA AGTATGCTCA TAGCCAGGAA CCTGAATAGC CCAGTCAGGA TGTTGACGAT 15060 

ACAAATCACT AT C T AC AG AA ATCATTTCGG GTTCTAACCA AAGTCCAAAC TGCAAACCTC 1512 0 

TTTCATGGAT AGCTGAAATC AGACTTTCTA GACTTCCACC CAGTTTTTCC TCATTAACAA 15180 
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CCCAATCACC TAAAGCACGA TTATCATCAA AACGATTGCC AAACCAACCA TCATCTAATA 15240 

CAAAAAGTTC AATGCCAACT TTCTTAGCTT CATCTGCTAA CTCTAACAGT TTTTCTCTCT 15300 

GAAAGTCAAA GTAAGTAGCT TCCCAGTTAT TGATTAGAAT TGGACGTTCT TTTTTAGAAA 15360 

ATTCACTTAG CATAATGTGC TTCAGTACAA AATTCTGACT TTCATGACTA ATACCAGTTA 1542 0 

ATCCCTGATC TGAATGAGTC ACTAAAGCTA CCGGTGTTTC AAAGTATTCC TCAGGAGCTA 15480 

ACTTCCAAGA AAAGTTTTCT GGATTAATGC CAATAGCCAC CCGAACTTCA TTCAATTGAT 15540 

TTTTTTGAAC AAAAGCTTCA AAGTTGCCAC TATACATTAG TTGAATAGCA AACACATTCC 15 500 

CAGCATCCTC TGTGACTCCT TGTTCGCATA GTAGAAGAGC TGGTGTTTGA GCATGACCAG 15 660 

AAGCACCTCG GTTTGAACTA ATCGAAAAGA TTCCTTGTTC TACCTGTTGA CGTCTAACAG 15720 

TCTTTTCACG AGCATAAGCA CCCTGCAGAG TTACTATTTC GTAATCTGCA GCTGGAAAAT 157 BO 

CAGCCATAAA AGAAAAATCT TTATGGATGA CAACTTCCTG ATTACTATTA TTATCTAATT 15840 

TACTGTAGCT AGCAATAGTC GCATCATTAT TAAAAGTAGT ATAATACAAA GTCAGACTAA 159 00 

GTTGAGCCTT AGAATCTTCT AACATTAAGA CAAGAGTCTC TGTATCGTCC ATGCTATGTG 15960 

GAGAAGGTAA GCCCTGTGGA CCATTCTGAC CTTTTAAAAT CTTTGCTTCT ACAAATCGAA 16020 

AGTCTGTTAC TTCAGTTACA CTATGCTGAA CCTGTATGGT TGGTTTCCTA AAATCTCCTA 16080 

AGCCATGTTG TCCAAAAATC TGTCGCTGAG TATCTAAACT AAAGGTTCGA TTAGTAGCCG 16140 

TTGGATTTCC TGAAAAGGCA TGGTCTCGTT CATAAACACT ATTGGAACCT TTATAGTTCT 16200 

TAATAGTCTT TCCTAAATGT TTCAAAAGTA AGTAGCCATT TCGATTTTCA ATAATCAAAC 162 50 

TTAGATTTTT ACTCTCAACA TAAAATAGAT TATTCTCTAT CCTAACTCCC ATTTACTTCA 16320 

CCTCATCACT TTATTGATTA TATTTTATCA CCTGAAATCG CTTTCCAAAA TAGAAAAATG 163 80 

TCTCAAGAAT ATGGTAAAAT GTTAGGTAGG AGGTAGCACA TGTTAGTTTT TTCAGAATAC 16440 

CAGACTGGAA CAATCGACCT TGCCCTAAGC TTTTATGGAT ATGAGGAATG CACACCTAAT 16 500 

TACTCTTTTG GTCCAGCCAT TCGTGATACA TACGTTCTAC ATTACATTAC TAAAGGACAA 165 60 

GGAAAATTTC ATTACAAGGG TAAAATTGTT GATTTAAAAG AAGGAGATTT CTTTCTATTA l6620 

AAACCAGAGG AACTAACCTT TTATCAAGCA GATAGTAAAG AACCTTGGGC CTACTACTGG 16680 

TTAGGAATCA CTGGAGGGAA AGCCCCTGAT TATTTTGCTC TTTCCCAAAT TTCTGATCAA 16740 

TCCTATCTCA TCCAATCTGA AACTTGTCAT ACCCAGACTA CTGCAAAACT CATCTCAGAC 16800 

ATTGTCCGCT TCGCTCAGAT TACAAAATCA AGTGAATTAG CTCAACTCCA TATCATGGGA 16860 

CAACTTCATG AACTGATGTT TCATCTGGGA ACTATTGCTC CCAATCAGAA AAAAAAGAAT 1692 0 

ATTTCATCAA CCCACCAACT CTATCTTGAA TGCAAACGAT TAATTGATAG CCACTATCCT 16980 
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AGCGTATTCA AAGAATTTAA TACCTTATCA CCCAAAGAAT ACCTACTCTA CGTTCGAATG 17100 

CACCGAGCTA GACAACTTCT CGAAAATACC CAAGAGTCCA TCAAGGTAAT TGCATACTCG 17160 

GTAGGTTTTT CAGATCCACT CCATTTTTCG AAAGCTTATA AACAATACTT TAATCAGACT 172 2 0 

CCAAGTCATA CAAGAAAAGA ATACTCTCAA TACCAACTAG TAAGAAAGGC AACATTATGA 17 2 80 

AATCCTACCA AGCTGTCTAC CAAATCCTAT CTAAAGAAAC CGACTATATC AGCGGAGAAA 17 34 0 

AAATCGCAGA AAAACTATCC CTAAGCCGAA CAGCAATTTG GAAAGCCATC AAGCGACTAG 174 00 

AACAAGAAGG CATTGAAATT GATAGTATCA AAAATAGAGG ATATAAACTG ATGAATGGTG 17460 

ACCTTATTCT TCCAGAGATT CTAGAAGAAA ATCTTCCAAT TAAAGTCAGC TTTAAACCCG 1752 0 

AAACAAAATC AACACAACTA GATGCAAAAG AAGCAATTGA TTTAGGCCAT GAAGCAAATA 175 80 

CCCTCTATCT AGCTTCCTAT CAAACAGCAG GCCGAGGCCG TTTTCAACGT TCCTTCTACT 1764 0 

CACCACAAGG TGGTATTTAT ATGACACTCC ATCTTAAACC AAATCTCCCC TATGACAAAT 177 00 

TACCATCCTA CACACTACTT GTAGCTGGAG CTGTCTACAA AGCCATTAAG AACCTAACTT 177 60 

TAATAGATGT CGACATAAAA TGGGTCAATG ATATCTATCT AAACAATCAT AAAATTGGAG 1782 0 

GAATCCTTAC TGAAGCAATG ACCTCTGTAG AAACTGGCTT AGTCACAGAT ATCATTATTG 17880 

GAGTAGGTAT CAATTTCACT ATTAAAGACT TCCCTCAGGA ATTAAAAGAA AAAGCTGCCA 17 94 0 

GCTTATTTAA AGCTACAGCT CCTATAACAA GGAATGAATT GATCATAGAA ATCTGGCGTG 18000 

CTTTCTTCGA AACACCAGCA GAAGAGCTAT TATACCTATA CAAAAAACAG TCATTCATTC 18060 

TAGGAAAAGA AGTCACTTTC ACACTAGAGC AAAAAGACTA CAAGGGACTT GCTAAAGACA 1812 0 

TCTCAGAAAA TGGAAAACTT TTAGTTCAAT GTGATAACGG AAAAGAAATC TGGCTAAATA 18180 

GTGGCGAAAT TTCTCTCAAT AGTTGGAAGT AAAATAACAC AATTATAATA TAAACGATAT 18240 

AAAAATAACT TCAGATTAGT AATTCAATTA AGTTTTACGG ATCTGAAGTT TTATTGGCTC 183 00 

TAAAAATAAA AAAGAGAGTT ACAGACTCTC ATTAAAACGG AGAATAAGGG ATTCGAACCC 18360 

TTGCGCCAGT TACCCGACCT AACGATTTAG CAAACCGTCC TCTTCAGCCT CTTGAGTAAT 18420 

TCTCCAATTA ATGGGCACGA GTGGACTCGA ACCACCGACC TCACGCTTAT CAGGCGTGCG 18480 

CTCTAACCAC CTGAGCTACG CGCCCAAGTT AAAAAACTTG GTAATTTGAA CAAAGTTCAA 18540 

AGCGGGTGAC GAGAATCGAA CTCGCGACAA CAGCTTGGAA GGCTGTAGTT TTACCACTAA 18600 

ACTACACCCG CATAAATACT ATCAATAAAA TGGCGCGAGA CGGAATCGAA CCGCCGACAC 18660 

ATGGAGCTTC AATCCATTGC TCTACCAACT GAGCTACCGA GCCTTATTGC GGGAGCAGGA 1872 0 
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TTTGAACCTA CGACCTTCGG GTTATGAGCC CGACGAGCTA CCGAGCTGCT CCATCCCGCG 18780 

TTAATAATAT AAAAGGAGGA TGTGGGATTC GAACCCACGC ACGCTTTTAC ACGCCTGACG 18 840 

GTTTTCAAGA CCGTTCCCTT CAGCCGGACT TGGGTAATCC TCCAATATTC AAATGGACCT 18 900 

TGTAGGACTT GAACCTACGA CCACTCGGTT ATGAGCCGAG AGCTCTAACC AGCTGAGCTA 18960 

AAGGTCCGAC AAGATCATTA TAGCGGCGAA GGGGATCGAA CCCCCGACCT CCCGGGTATG 19 020 

AACCGGACGC TCTAGCCAGC TGAGCTACAC CGCCATGAAT CGGGAAGACA GGATTCGAAC 19080 

CTGCGACACC TTGGTCCCAA ACCAAGTACT CTACCAAGCT GAGCTACTTC CCGAGTTAAA 19140 

TAGAAAAATG CACCCTAGAG GAGTCGAACC TCTAACCGCC TGATTCGTAG TCAGGTACTC 19200 

TATCCAGTTG AGCTAAGGGT GCTCCATATT ATGCCGAGGA CCGGAATCGA ACCGGTACGA 192 60 

TCGTTACCAA TCGCAGGATT TTAAGTCCTG TGCGTCTGCC AGTTCCGCCA CCCCGGCCTC 19320 

TCTAAGCGAA CGACGGGATT CGAACCCGCG ACCCCCACCT TGGCAAGGTG GTGTTCTACC 19380 

ACTGAACTAC GTTCGCACTG TTTTCTTCTA TCTAAAAATG CCGGCTACAT GACTTGAACA 19440 

CGCGACCCTC TGATTACAAA TCAGATGCTC TACCAACTGA GCTAAGCCGG CTCATTTGTT 19 500 

ATATCTTAAT GCGGGTTAAG GGACTTGAAC CCCCACGCCG TTAAGCGCCA GATCCTAAAT 19560 

CTGGTGCGTC TGCCAATTCC GCCAAACCCG CATATATGAC CCGTACTGGG CTCGAACCAG 19 620 

TGACCCATTG ATTAAAAGTC AATTGCTCTA CCAACTGAGC TAACGAGTCT AAAATAACTT 19680 

GCGTTACCTT AAACGGTCCG ACGGAATCGA CCCGGTAC 19718 
(2) INFORMATION FOR SEQ ID NO: 100 t 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4117 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

CCGTGGAAAA GTCTGGATAG TGAATGGTCT TCACACAATG ACCTGAAAGA AGCCTGAGAA 60 

TAATTATGGA GAGTAGCATT CTGAGAGGTG TTAGCAGAAC CATATGACAG AGCTGTTTGA 120 

AGAGGGAATA TTGAGGAGAA AAATCCTGAG CCTACCAGTT GGAGTTGGAA AGAGCTGACT 180 

GTTAGATCAT GGTTTATTAT CCACAACCTG TGGATAACTT TGTGAATAAG AGAAGTTGCT 240 

AAAGAAGGAG ATATATAACG ATGAAGAAAA TCAAACCGCA TGGACCGTTA CCAAGTCAGA 300 

CTCAGCTAGC TTATCTGGGA GATGAACTAG CAGCTTTTAT CCACTTCGGT CCTAATACCT 3 60 

TTTATGACCA AGAATGGGGG ACTGGACAGG AGGATCCTGA GCGCTTTAAC CCGAGTCAGT 420 
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TGGATGCGCG TGAGTGGGTT CGTGTGCTCA AGGAAACGGG CTTCAAAAAG TTGATTTTGG 480 

TGGTCAAGCA CCACGATGGC TTTGTCCTTT ATCCGACAGC TCACACAGAT TATTCGGTTA 540 

AGGTCAGTCC TTGGAGGAGA GGAAAGGGCG ACTTGCTCCT TGAAGTATCC CAAGCTGCCA 600 

CAGAGTTTGA TATGGATATG GGGGTCTACC TGTCACCGTG GGATGCCCAT AGTCCCCTCT 660 

ATCATGTGGA CCGAGAAGCG GACTACAATG CCTATTATCT GGCTCAGTTG AAGGAAATCT 720 

TATCAAATCC TAACTATGGG AATGCTGGTA AGTTCGCTGA GGTTTGGATG GATGGTGCCA 780 

GAGGAGAGGG CGCGCAAAAG GTTAATTATG AATTTGAAAA ATGGTTTGAA ACCATTCGTG 840 

ACCTGCAGGG CGATTGCTTG ATTTTTTCAA CAGAAGGCAC CAGTATCCGC TGGATTGGCA 900 

ATGAACGAGG GTATGCAGGT GATCCACTGT GGCAAAAGGT GAATCCTGAT AAACTAGGAA 960 

CAGAAGCAGA GCTGAACTAT CTTCAGCACG GGGATCCCTC GGGCACGATT TTTTCAATCG 1020 

GAGAGGCAGA TGTTTCCATC CGTCCAGGCT GGTTCTACCA TGAGGATCAG GATCCTAAGT 1080 

CTCTCGAGGA GTTGGTCGAA ATCTACTTTC ACTCAGTAGG GCGAGGAACT CCACTCTTCC 1140 

TTAATATTCC GCCGAATCAA GCTGGGCTCT TTGATGCAAA GGATATTGAA CGACTTTATG 1200 

AATTTGCGAC CTATCGCAAT GAGCTCTATA AAGAAGATTT GGCTCTGGGA GCTGAGGTAT 1260 

CTGGTCCAGC TCTTTCCGCA GACTTTGCTT GTCGCCATTT GACAGACGGC CTTGAGACCA 1320 

GCTCTTGGGC AAGCGATGCA GACTTGCCCA TCCAGTTAGA ACTCGACTTA GGTTCTCCTA 13 80 

AAACTTTTGA TGTAATTGAG TTAAGAGAAG ATTTGAAGCT AGGGCAACGA ATCGCTGCTT 1440 

TTCATGTGCA AGTAGAGGTG GATGGTGTCT GGCAGGAGTT TGGTTCGGGT CATACTGTTG 1500 

GTTACAAACG TCTCTTACGA GGAGCAGTTG TTGAGGCACA GAAGATACGT GTAGTCATTA 15 60 

CAGAATCACA GGCTTTGCCT TTGTTGACCA AGATTTCCCT TTATAAAACT CCTGGATTAT 1620 

CAAAAAAAGA AGTTGTTCAG GAACTAGCAT TTGCAGAAAA AAGCCTAGCT GTGGCAAAGG 16 80 

GAGAAAATGC CTATTTTACA GTTAAGCGCA GAGAATGTAG TGGTCCTTTA GAAGCTAAGA 1740 

TTTCGATTCA ACCGGGGACA GGTGTCCATG GTGTCGCCTA TCAGGATGAG ATTCAAGTCC 18 00 

TTGCGTTTCA AACTGGTGAG ACTGAAAAAA GTCTGACGCT ACCAACCTTG TATTTCGCAG 18 60 

GAGATAAAAC CTTGGATTTC TATCTGAACC TAACGGTGGA TGGTCAGCTT GTGGATCAAC 1920 

TTCAAGTCCA AGTTTCATAA AAGAAGAACC TTTGCGCGAT GCAAAGGTTC TTTTGGTTAT 19 80 

TAGTGACTTG GTAACCAGCT GAGGGTGAAA GTTAGTTGTT CAGCTTTTAA GAGGTCTTGG 204 0 

TGTTGAATAG TTGATACGAG TGTTTTGTCC AGTCGGCATT CTTTGACAAA GTTAAAATGG 2100 

TTGTGGTTTT GTTTAGTATG GATATCCAGC CATTTATCTT CTTTAGCGAG GTAGACTCGT 2160 
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AGATGGTCAA AGAGAGGGAT TCCGAGGTCA TAGCTTGGTT TTCCTGGACA GGTTGGATAA 2220 

AATCCGAGAG CTGACCAGAT GTACCAAGCA GAGAGACTAC CATTGTCTTC ATCTCCAGGA 2280 

TAGGCTTCCC AACTTGGGTG AAAAGCTTTC TGACGGAGCG TCTTGATAAG AAGGGCAGTG 2340 

TAGTCAGGGT AATCGCTGTA ACGGAAGAGA TAAGGAATGT GGAAACTAGG CTGGTTGGAA 2400 

ATGGCTATTT GTCCAAAAGG AGCAGTAGCC ATCTCGCTCA TTTCGTGAAT TTCGTAACCA 2 460 

TAGCCTGTTG TTTCAAAGAG GGGAGCATCT TGACAGGCTT TCAAAAGATA GTTGCTAAAG 2 520 

GTTTCTTTTC CACCCATCAG TTGGATTAAG CCAGGGATGT CGTGGAGAAC GCCTAAAGTA 2 580 

GCTTGAATGG CAGAGCATTC AGCGTAGTCT CGCCCCCAAC TATAAGGAGA GAAGTCAGGG 2 640 

TGAAAGTTTC CTTGATTGTC TCGTGCTCGC ATGTAACCTG TCTCAGCGTC AAATAGCTGG 2 700 

CGGTAATTTT GTGAAGCAGC CTTGTAGGTT TCAGCGATTT CTATGTTCTC TAGTTTTTTG 2760 

GCACAGCTGG CGATACAAAA GTCACTATAG GCATAGTCTA GAGTATGGCT AACACTTTCG 2 820 

TGGTGGTCGG TAGAGAGGTA ACCTAGTTCT TGGTATTGGG CTAGTCCGTG GCGGCCATTG 2 880 

ATGCCGAGAG GGTCGGCTTT GCTGGCTGTT TCGAGCATGG CTTGGAAGAG TTCTCCTTCT 2 940 

AGGTCGGGGG TCATGTCCTT GCAGGCGCTA TCTGCGATAA TACCGTCTAA AAGTGTACCT 3 000 

GGCATCATAC CCCGTTCATC TGGAGCCAGC CATTTTGGAA GGAAACCAGT ATCGCGGTAG 3 060 

CTATTGAGGA AACCTTCTAA AAAGCGTTGA TAGTGCTCCG GTATGATAAG GGCAAAGAGG 3120 

GGGAAGGTGG TGCGGAAGGT ATCCCAGAAA CCATTGTTGC TAAAGAGGAC ACCAGGCTTG 3180 

ACAGTACCAG TAGCCAGATC CATGTGGATG GCTTGCCCTG ATTCATTAAT CTCATAAAAA 3240 

GTCTGTGGGA AGAGGAAGAG TCTGTAGAGG CAGTGGTCAA AGAAGGTTCG GTCAGCCTCT 3 300 

CCTGTCTCTA TAATGTCAAA ACGATGGAGG AGATTTTCCC AATCCACTTG GGCACTTGAT 33 60 

TTACAGCTAT CAAAATCTTC TTGAGGTAGA TTGATTAGAG CTTGAGAAGG AGAGATGAAA 3420 

GAAGTGGCTA GTTGCATCTC GGTTTGACTA CTTGCTAAGT CAATTCGCCA GTCTCCAGCT 34 80 

TCTTGGCTGA TAGCAAGAAT ATCCGTGTTC ATTTGCAGGG CAGTGAACAT CGTTAGCGAA 3 540 

TTTTTGTTAG TTTCAGTTTT ACCTTCTTGT CGCAGGGCAA GAGTCCGCTT ATCTACTTGC 3 600 

TCTACTGTCA GTTCATCTGC TGCGTGAAGA TAGAGGGAGA GGGCTTTGCC TTGCTTTTGA 3 660 

TTCAAACGAA TAGAAGCACC ATAGCAAGTC GGTGTGAGCT GGGTTTCAAT CTGATAACGC 3720 

AGAGAAAAGA GCTTCAAATA GTGAGGCTGG AAGCAAGCTT TATCTATATC ATAAGAAGAC 3780 

TGGCGGTGAA AGAGGCTGTC TCCCCCCAGT TGACTGGTGA CAGGTGTCAG AAGGAGCCAA 3840 

GAGTAGTCCC CAATCCAAGG ACTGGGCTGG TGAGTTAATC GAATCCCCTG AAAGATAGGC 3 9 00 

AGATGTGGAT CAAAAAACCA AGATCCATCC TGGTCACTGG TCTGGGGCAC AAAGTAATTC 3 960 
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ATCCCAAAAG GCACGCCTGT GTATGGCAGG GTATTTCCCC GAGAAAAGGC ATGCTTGTTG 402 0 

GTAGTTCCAA AACGGGTATC GATGGTATCA AGTAGTGGTT TCATAGTCTT TCCTTTAGCT 4080 

GTTTTTCTAC ATTATATCAG TAATAGAGGG CCTTTAG 4117 
(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2727 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CTGGTTCAAT TATTATTCAC TCTAAGTAGT CATATGTTCT TTATTTATGT GAGTTTTTAC 60 

CTTTTAAAGG ATCTTGTTAG ATGGGAGAAG GTTTTAAAAG TGACAGATGA TAATACAAGA 120 

AAAGTTCGTT TATTAGTAGC CTTTTTTAGC ATTGTCATAG GCTACATCCT GAGTTCTTTC 180 

TTTATTAGCC TGTATCATTT GTGGCAAGAA GCGCTTAGAG GATTATTATG AAATCAAGAG 240 

TAAAGGAAAC GAGTATGGAT AAAATTGTGG TTCAAGGTGG CGATAATCGT CTGGTAGGAA 300 

GCGTGACGAT CGAGGGAGCA AAAAATGCAG TCTTACCCTT GTTGGCAGCG ACTATTCTAG 360 

CAAGTGAAGG AAAGACCGTC TTGCAGAATG TTCCGATTTT GTCGGATGTC TTTATTATGA 420 

ATCAGGTAGT TGGTGGTTTG AATGCCAAGG TTGACTTTGA TGAGGAAGCT CATCTTGTCA 480 

ACGTGGATGC TACTGGCGAC ATCACTGAGG AAGCCCCTTA CAAGTATGTC AGCAAGATGC 540 

GCGCCTCCAT CGTTGTATTA GGGCCAATCC TTGCCCGTGT GGGTCATGCC AAGGTATCCA 600 

TGCCAGGTGG TTGTACGATT GGTAGCCGTC CTATTGATCT TCATTTGAAA GGTCTGGAAG 650 

CTATGGGGGT TAAGATTAGT CAGACAGCTG GTTACATCGA AGCCAAGGCA GAACGCTTGC 720 

ATGGTGCTCA TATCTATATG GACTTTCCAA GTGTTGGTGC AACGCAGAAC TTGATGATGG 7 80 

CAGCGACTCT GGCTGATGGG GTGACAGTGA TTGAGAATGC TGCGCGTGAG CCTGAGATTG 840 

TTGACTTAGC CATTCTCCTT AATGAAATGG GAGCCAAGGT CAAAGGTGCT GGTACAGAGA 900 

CTATAACCAT TACTGGTGTT GAGAAACTTC ATGGTACGAC TCACAATGTA GTCCAAGACC 9 60 

GTATCGAAGC AGGAACCTTT ATGGTAGCTG CTGCCATGAC TGGTGGTGAT GTCTTGATTC 102 0 

GAGACGCTGT CTGGGAGCAC AACCGTCCCT TGATTGCCAA GTTACTTGAA ATGGGTGTTG 1080 

AAGTAATTGA AGAAGACGAA GGAATTCGTG TTCGTTCTCA ACTAGAAAAT CTAAAAGCTG 1140 

TTCATGTGAA AACCTTGCCC CACCCAGGAT TTCCAACAGA TATGCAGGCT CAATTTACAG 12 00 
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CCTTGATGAC AGTTGCAAAA GGCGAATCAA CCATGGTGGA GACAGTTTTC GAAAATCGTT 12 60 

TCCAACACCT AGAAGAGATG CGCCGCATGG GCTTGCATTC TGAGATTATC CGTGATACAG 132 0 

CTCGTATTGT TGGTGGACAG CCTTTGCAGG GAGCAGAAGT TCTTTCAACT GACCTTCGTG 13 80 

CCAGTGCGGC CTTGATTTTG ACAGGTTTGG TAGCACAGGG AGAAACTGTG GTCGGTAAAT 1440 

TGGTTCACTT GGATAGAGGT TACTACGGTT TCCATGAGAA GTTGGCGCAG CTAGGTGCTA 1500 

AGATTCAGCG GATTGAGGCA AGTGATGAAG ATGAATAAGA AATCAAGCTA CGTAGTCAAG 1560 

CGTTTACTTT TAGTCATCAT AGTACTGATT TTAGGTACTC TGGCTCTAGG AATCGGTTTA 162 0 

ATGGTAGGTT ATGGAATCTT GGGCAAGGGT CAAGATCCAT GGGCTATCCT GTCTCCAGCA 1680 

AAATGGCAGG AATTGATTCA TAAATTTACA GGAAATTAGG CTGGAGAACC AGCCTTTTTC 1740 

TAAAGATAAG GAGAAATATG AACAAAAAAA CAAGACAGAC ACTAATCGGA CTGCTAGTGT 1800 

TATTGCTTTT GTCTACAGGG AGCTATTATA TCAAGCAGAT GCCGTCGGCA CCTAATAGTC 18 6 0 

CCAAAACCAA TCTTAGTCAG AAAAAACAAG CGTCTGAAGC TCCTAGTCAA GCATTGGCAG 192 0 

AGAGTGTCTT AACAGACGCA GTCAAGAGTC AAATAAAGGG GAGTCTGGAG TGGAATGGCT 1980 

CAGGTGCTTT TATCGTCAAT GGTAATAAAA CAAATCTAGA TGCCAAGGTT TCAAGTAAGC 204 0 

CCTACGCTGA CAATAAAACA AAGACAGTGG GCAAGGAAAC TGTTCCAACC GTAGCTAATG 2100 

CCCTCTTGTC TAAGGCCACT CGTCAGTACA AGAATCGTAA AGAAACTGGG AATGGTTCAA 2160 

CTTCTTGGAC TCCTCCAGGT TGGCATCAGG TCAAGAATCT AAAGGGCTCT TATACCCATG 222 0 

CAGTCGATAG AGGTCATTTG TTAGGCTATG CCTTAATCGG TGGTTTGGAT GGTTTTGATG 2280 

CCTCAACAAG CAATCCTAAA AACATTGCTG TTCAGACAGC CTGGGCAAAT CAGGCACAAG 2340 

CCGAGTATTC GACTGGTCAA AACTACTATG AAAGCAAGGT GCGTAAAGCC TTGGACCAAA 2400 

ACAAGCGTGT CCGTTACCGT GTAACCCTTT ACTACGCTTC AAACGAGGAT TTAGTTCCCT 2460 

CAGCTTCACA GATTGAAGCC AAGTCTTCGG ATGGAGAATT GGAATTCAAT GTTCTAGTTC 252 0 

CCAATGTTCA AAAGGGACTT CAACTGGATT ACCGAACTGG AGAAGTAACT GTAACTCAGT 2580 

AAAAGATACG CCTACACTCC TATGTCACTT ATGGATGTAG GAGTTCTTTT TACTAGTTTA 264 0 

AGCAGGACTA AGACAGGTAC TAAGACAAAA TAGCAACTTC TAAAACTAAC TTCCAGTTTT 2700 

GGGAGAGAGA TGGAAGTTAC TTTGAGA 2727 
(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi| SEQUENCE DESCRIPTION: SEQ ID NO: 102: 

TTTTTTGTAG ATTTAAGTGG GGTGCAATTC CTAAAAAATA AAAAACAATT TTTGAAAATT 60 

ATGTTAGCAG GAATTGCTTC AAATTCGATT TTATCACTTA CAGGTTTACT TGTTTTATTG 120 

TTCACATCGT ATAAATTGCT TGGACTCTTA TTTTTTATCA TTAACTTAGG TATGATTTTT 180 

ATTAATTCAA TTCCTTTTTT TCAGTATGAT AGTGGTATTA TTTTAAGATA CTTGAATTCT 240 

AACAATAATA ACTTGAATTT TCAATATATA GTTCAACTTT TAATAGCATT TGTTATTATT 300 

TATTTTCCTT TGAGTCAACT ATTACAGTTT TTGACACCCA ATATTATTGT TCGTAGTATA 3 60 

GGAGGGGTGG TTGTTTCTAT ACTGCTTTCT ATATTATATA TGATAGGAAG GACGAAATAT 42 0 

GTTCTACGTA AATAGTTATG TTTTTGCTTA TAAAAAAGAA GGTATAATGT ATTTACGTGG 480 

TCGGAGTATG CGGGAAATAG CTATAGAACC TCAAATTTCG CAAGAATTTA TCAACGATCT 540 

ATTTAATAGT TGTAAGGAAC TATTAGAGAT AGAAGAAGTA TTAGGCAGTA AACTAACATT 6 00 

TGAACTATAA ATGAACAAAT TTTAATTTCG GATGAGATAG ATATTGATAG TAGATATTCT 6 60 

AGAACTAAAG GTTACTATTC GTTATTTTAT AATGAAGAGT ATAATAAAAT ACAGAATAAA 72 0 

ACAGTATTAG TATTAGGAGC AGGAGTCTTA GGATGTTATA TATCTCTAAG TCTAAGTATG 780 

TATGGAGTGA GGAAACTTAT TGTCGCTGAT TACGATATAA TAGAACCATC AAATTTAAAT 840 

AGGCAAATTC TTTATACAGA GTCGGATGTT GGTAAGGAGA AGATTAATG T TCTTTCTGAA 900 

AAAATACACA AGTATAATTC AGATGTTCAG GTAGTACCTA TTTCTATTAA AGTTTCTTCA 9 60 

GTAGAAGAAT TAGAAAAAAT TGTTGCGGAA TATGGGAGTA TAGATTTTAT CGTTAAAGCA 1020 

ATTGATACGC CCATTGATAT TATAAAAATT GTCAATCAAT TTGCTGTATC GCATAAGATA 1080 

TCCTACATAT CAGGAGGGTT TAATGGATGC TATCTTAITA TTGATAATAT ATATATCCCT 1140 

ACCATCGGTT CTTGCTTTGG TTGTCGGAAT ATAAACAAAG ATATAAATAA GTACACTTTA 1200 

TCTGATAAGA CAAAGTGGCC GACTACACCA GAGATGCCTG CTATTTTGGG AGGGATAATG 12 60 

ACTAATTTAA TAATTAAAAT ATTTCTGGGA TGTTATAATG AAATCCTAAT AGATAACGCT 132 0 

TACGTTTATA ATATGAGAAA TCATGCTCTA AGTCAAGAAA AATATGTTCT GGAAAACGGA 1380 

GAATGTCCAA TTTGTAAAAA AATAATAAAG TGAAAGATAA CAATATTAGA GCGAAAACAT 144 0 

TTATTCGTTC AGTTTGTTTT TGCTTATTAT CAGGAGGAGT AGCTTTTTTA TCTGCTATTG 1500 

GGCAGTTCAC TGTTATAGAA ACACAATTAA TAGTATTGTT CTTGGGTATT ATTTTTGCTA 1560 

TATATTATGC TTACTACAAT AAAAATATTC AAACATCATT GGAAAATATA GTATGGCTTT 162 0 
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TTTCATCGTT TGAGATTTTA TTTTTGCTTG TTAATTTTAG AACATTTATT CAGTTACCAG 1680 

TGGATATTTT TATTGGTATG ATAATATTTT TAATGCTGTG GATATTTATT ATGTTAGGTA 1740 

TAGTGTGTCT TAGTTATTAT ATAACTTTAT TATTTAGCAA GGAGGCTTAG TATGTTTAAA 1800 

AAAATAGGTA TAATGAGCAT TTGCATATAT ATAATTATTT TATACTGCTT GAGAATGTAT 1860 

CGTATTATCA ATAATATTGA AACAATCTTG CTAACGGTTA TATGCTTAAT GTTATTGTTT 1920 

TTTTTAAGAC GTTTATTTGA TAAAGATAAG TAAATAGATG TTAAGTAAAA ATGTAGAATA 1980 

TAAAGGAGGT GCAATGAGTA TGATTGAAGT TAGCCATTTA TCAAAAAGTT TTGGTGATAA 2 040 

AATAGCTTTA AATAATATAA GCTTCACTGT TAAAGAAGGT TAGATTTTTG GATTTTTAGA 2100 

ACCATCTGGT TCTGGAAAGA CCACAACGAT TAATATTCTG ACTGGGCAGT TCCTTGCCGA 2160 

TAAAGGACAA TCTATTATTT TGGGACAAAA ATCTCAAAAT TTAACAAGCG GTGAATTAAA 2 220 

GAGAATTGGA TTGGTTAGCG ATACAAGTGG ATTTTATGAG AAAATGTCTC TGTATAACAA 2 280 

TCTTCTTTTT TATAGTAAAT TTTATAATAT TAGTAAATCA CGTGTTGATA ATTTGTTAAA 2 340 

GCGAGTAGGA TTATATGATA GTCGCAAGAT GGTAGCAGGA AAATTATCCA CTGGAATGAG 2400 

GCAACGAATG CTTTTAGCAC GAGCTCTTAT CAACAACCCC GCTGTACTCT TTCTGGATGA 2460 

ACCGACCTCA GGTCTAGATC CCACAACTTC TCGAACAATT CATGAGTTAA TTTTAGAATT 2 520 

GAAAACAGCA GGGACAACGA TTTTTCTAAC GACTCATGAT ATGAATGAAG CAACTCTTTT 2580 

ATGTGATTAT GTTGCCTTAT TAAATAAAGG GAAATTAGTT GAGCAAGGAG CTCCTTCTGA 2 640 

ACTCATTCAA AGATATAATA AAGATAAAAA GATTAAGGTT ACAGATTATA ATGGGAATCA 2700 

GATAACTTTT GATTTTACAT CACTAGAACA GGTATCTCAG ACTGATCTGG AAAATATTTT 27 60 

TTCAATTCAT TCATGTGAGC CTACTTTAGA AGATATTTTT ATCACATTAA CAGGAGGAAA 2 820 

GCTAAATGCT TAAACGGTTT CTGGCTTTGG TATGGTTGCG TTGTCAAATC ATCCTTTCCA 2880 

ATAAGAGTAT TTTATTGCAA GTTTTAGTGC CTTTTGCTTT CACATATTTT TATAAATATC 2 940 

TTATGGAAAC ACAGGGGAAG GTCAACGATC AACAGGCATT AGTTCTTTTG ATGATGTGTT 3000 

TACCTTTTTC TTTTTCTTTG GCTGTTGGAA GTCCTATAAC TATTATCTTG TCTGAAGAAA 3 060 

AAGAAAAGTA CAATTTACAA ACTCTTCTGT TGAGTGGTGT TAAAGGCTCC GAATACATTT 3120 

TATCAACTAT GTTTCTTCCT TTTTTGCTAA CTTTTGTGAT TATGGGAACT ACTCCTCTTA 3180 

TTTTAGGAGT TACAATTGTA CATACTTTTA ATTATATTAC AATCGTTCTT CTAACCTCTT 3240 

TATCCATCAT TTTATTCTAT TTATTGATAG GTTTAACCGC GAAGAGCCAA GTAGTAGCTC 3 300 

AGGTTATCAG TCTTCCTGCT ATGATTTTAG TTGCTTTCTT ACCGATGCTA TCTGGTTTGG 3 3 60 

ATAAGACAGT TGCGAAGATA ACAGATTATA GTTTTATGGG ACTATTTACT AAGTTTTTCA 3420 
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CAAAATGGGA GGAATTTTCA TGGAATAAAA CTCTAATTCC TAATCTAACA CTACTTATTT 3 480 

GGATTGTTCT TCTATTAACT TTAATTACGA TAACTATTAG GAAAAAGAAA ATTTCTTAAT 3 540 

TGAGTTATTT TAATGATTAT AAACACAAGT GGGAAGGAAA AAATGAACTG ATCTTTTTGA 3 600 

CAGCAATTCT ACAGAATAGT CTTATTGCTA TATTTTGATT TGAGTGTACG AAAAAAGAAA 3 660 

AATAACAATA GTGCTCATAC TAATTGCAGA AGTTTTGGGT GATAAGATAA CTGATAAATT 3720 

GCAATAAAAA ATGCAACATT TTTAAATCTC CTCTATAAGT GCTTCAAAAA GTGCTTCAAA 3 780 

ACCTGTCTTG TAATCCAAGT ATTTTTGGGG ACGGTGATTA ATAAGCTAGC AAAGCATCAT 3 840 

TAAGGATTTT TTCGGTAATT GTTGCCAAAT CGGTTTAAGA AAATACTCAC GAAGAAGTCC 3 900 

ATTCGCATTC TCATTACTTC CCCTTTGCCA AGATGAATAG GCATCCGCAA AATAAAACAG 3 9 60 

AATTCCCATT TGTTCAATTA AAGGGTAACA AGCAAACTCT TTTTCTCTGT CCGAAGTGAA 4020 

AGTCTTTAAC TATTCTTTTG GAAAGAGTCT TGTGAGGTGT TCAATAGCAG TCAACATGGA 4 080 

TTTAGCTGTT TTTACTTGAC AAGTGCTAGT AGAAATAATA GAATAGTAAA AAACCTTTAA 4140 

AGCAGTCCAG AGAGGCAGCT AAGGTTAGAC GGTGAAAGGG TGGAGACTAC CCATTTTTCG 42 00 

TGGAACCTTG CTGTTGGCAG GTTCCTTTTT TCGTGGCTTC TGTTGGCCAG ACTCTCTCAC 42 60 

TAGTAAAGGT AAAAGGAGAA ACCTATGCGA GAACATCGTC CAATCATTGC TCTTGATTTT 432 0 

CCTAGTTTTG AGGCGGTCAA GGAATTTTTA GCTCTTTTCC CAGCAGAAGA AAGCCTTTAT 43 80 

CTCAAGGTAG GGATGGAGCT TTATTACGCA GCGGGGCCTG AGATTGTGTC CTACTTAAAA 4440 

GGTTTGGGTC ATAGTGTCTT TTTGGATCTC AAACTTCATG ACATTCCTAA TACAGTCAAG 4500 

TCAGCCATGA AGATCTTGTC TCAGCTTGGT GTCGATATGA CTAATGTCCA TGCGGCTGGT 45 6 0 

GGTGTAGAGA TGATGAAGGC GGCGCGTGAA GGTCTTGGGA GTCAAGCCAA ATTGATCGCT 4 62 0 

GTAACTCAGC TCACATCAAC GTCAGAAGCT CAGATGCAGG AGTTTCAAAA TATCCAAACC 468 0 

AGTCTGCAAG AGTCTGTGAT TCACTATGCC AAGAAGACAG CTGAAGCTGG CTTGGATGGT 474 0 

GTTGTTTGCT CGGCTCAGGA AGTACAAGTC ATCAAGCAGG CTACCAATCC AGATTTTATC 4800 

TGTCTGACAC CAGGGATTCG TCCAGCTGGT GTTGCAGTTG GAGATCAAAA ACGAGTCATG 48 60 

ACACCTGCTG ATGCCTATCA AATCGGCAGT GACTATATCG TAGTGGGACG TCCCATTACC 4920 

CAAGCTGAGG ATCCTGTTGC AGCTTATCAT GCCATCAAGG ATGAATGGAC ACAGGACTGG 4980 

AATTAAAGAA CTAGATTAGA AAAATAAAAG GAGAATACCA TGACACTTGC TAAAGATATC 5040 

GCTAGCCACC TCTTGAAAAT CCAAGCCGTT TACCTCAAAC C AGAGGAAC C CTTCACTTGG 5100 

GCATCTGGTA TCAAGTCACC GATTTACACT GATAATCGTG TGACACTAGC CTATCCAGAA 5160 
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ACTCGTACCC TAATTGAAAA TGGTTTTGTG GAAGCTATCA AAGAAGCCTT TCCTGAAGTA 522 0 

GAAGTGATTG CAGGAACTGC AACAGCAGGG ATTCCACACG GAGCCATTAT TGCTGATAAG 5280 

ATGGACTTGC CTTTTGCCTA CATCCGTAGT AAACCAAAAG ACCACGGAGC TGGTAATCAA 5340 

ATCGAAGGTC GCGTAGCTCA AGGTCAAAAA ATGGTAGTGG TTGAAGACCT TATTTCAACG 5400 

GGTGGTTCAG TTCTTGAAGC TGTAGCAGCA GCCAAGCGAG AAGGAGCAGA TGTACTTGGA 5460 

GTTGTAGCGA TTTTCAGCTA CCAATTGCCA AAAGCAGATA AGAACTTTGC AGATGCTGGT 5 520 

GTTAAACTTG TGACGCTTTC AAACTATAGC GAGCTTATCC ATCTAGCCCA AGAAGAAGGT 5580 

TACATCACGC CAGAGGGCCT TGATCTTCTA AAACGCTTTA AAGAAGACCA AGAAAATTGG 5640 

CAAGAAGGTT AGGTCAGTAA GATAAAGAGA GACGAGGCTA CCGAGTCTCT TTTACCATTT 5700 

TATTTAAAAT ATGACAG 5717 
(2) INFORMATION FOR SEO ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5558 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

CCTGGACTTT CTAAAATGAA ATCTTGCGAC CTGGATCAAG CCCTTCATGA GCATTTTTCA 60 

GAAGAAGAAT TAGCTGGTCA CTTTCATGTC CTTCTATGGA CTTTTTTTAC AATGGCATTG 120 

CTATCACACC CAATACCTAT CTAAGCGCCT GGTTCGTAAA CTTTATTGCA GCTCTTCCTC 180 

TAAATTTCCT AATTGTTGAA CCAATTGCCC GTTTTATACT AAGTTCTTTT CAGAAACCAT 2 40 

TTACTGGGGA AGAAGTTGAA GATTTTCAAG ATGATGATGA AATCCCAACT ATTATCTAAG 3 00 

CCAGTTCTGT AAACTACTAA TATTTGAAAT CCACTTCCTT TTAGGGTGCA ATGGTTATAA 3 50 

ATGAATTTTT GAGAGGATCA GAATGAAAAA ACTAGCAACC CTTCTTTTAC TGTCTACTGT 420 

AGCCCTAGCT GGGTGTAGCA GCGTCCAACG CAGTCTGCGT GGTGATGATT ATGTTGATTC 4 80 

CAGTCTTGCT GCTGAAGAAA GTTCCAAAGT AGCTGCCCAA TCTGCCAAGG AGTTAAACGA 540 

TGCTTTAACA AACGAAAACG CCAATTTCCC ACAACTATCT AAGGAAGTTG CTGAAGATGA 600 

AGCCGAAGTG ATTTTCCACA CAAGCCAAGG TGATATTCGC ATTAAACTCT TCCCTAAACT 660 

CGCTCCTCTA GCGGTTGAAA ATTTCCTCAC TCACGCCAAA GAAGGCTACT ATAACGGTAT 720 

TACCTTCCAC CGTGTCATCG ATGGCTTTAT GGTCCAAACT GGAGATCCAA AAGGGGACGG 7 80 

TACAGGTGGT CAGTCCATCT GGCATGACAA GGATAAGACT AAAGACAAAG GAACTGGTTT 840 
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CAAGAACGAG ATTACTCCTT ATTTGTATAA CATCCGTGGT GCTCTTGCTA TGGCTAATAC 900 

TGGTCAACCA AACACCAATG GCAGCCAGTT CTTCATCAAC CAAAACTCTA CAGATACCTC 960 

TTCTAAACTC CCTACAAGCA AGTATCCACA GAAAATTATT GAAGCCTACA AAGAAGGTGG 1020 

AAACCCTAGT CTAGATGGCA AACACCCAGT CTTTGGTCAA GTGATTGACG GTATGGATGT 1080 

TGTGGATAAG ATTGCTAAGG CCGAAAAAGA TGAAAAAGAC AAGCCAACTA CTGCTATCAC 1140 

AATCGACAGC ATCGAAGTGG TGAAAGACTA CGATTTTAAA TCTTAAAAAC CAAAAAAATA 12 00 

CAGTATCCAC ATTCGGTAGT GTATTTCTTT TACTCTCATT CTTAAGTTAA ATTATTAAAA 12 60 

TCCCATATTT GGTCTATCCA GCCTTCATAA AAGTCTGGCT CGTGGCAGAC CATAAGGATA 1320 

GATCCCCTAT ATTCTTTGAG AGCGCGTTTG AGCTCATCCT TTGCATCCAC ATCCAAATGG 1380 

TTGGTCGGCT CGTCCAGCAC TAAAACGTTG TTTTCACGAT TCATCAAGAG ACAGAAACGA 1440 

ACCTTGGCTT GCTCTCCCCC TGATAATACT TGAATCTGGC TTTCAATATG TTTGGTTGTC 15 00 

AAACCACAAC GGGCAAGGGC TGCACGGACT TCTGCTTGAT TAAGGGCAGG AAAGGCATTC 1560 

CAGACAGCTT CAAGAGGAGT TTGGCGATTA CCGCCTTCTA CTTCCTGCTC AAAATAACCA 162 0 

AGTTCTAAAT AATCTCCACG CTCCACTTCC CCAGCGATTG GCGAGATAAT GCCCAAGAGA 1680 

CTCTTCAAGA GAGTTGTTTT TCCAATACCA TTAGCACCAA TAATCGCAAC CTTTTGATTG 1740 

CGTTCGAAGG TAAGATTTAA AGGCTTAGTA AGAGGACGGT CGTAACCAAT TTGCAAGTTC 1800 

TTGGCTTGGA AGATAAAGCG CCCTGGTGTA CGAGCTGGTT TGAAATCAAA GGATGGTTTT 1860 

GGTTTCTCAC TTTGGAGTTC GATAATATCC ATCTTATCCA ATTTCTTTTG ACGAGACATA 192 0 

GCCATATTAC GAGTTGCAAC ACGGGCTTTA TTACGAGCCA CAAAGTCCTT GAGGTCTGCA 1980 

ATCTCTTTCT GCTGGCGTTC GTAGGCTGCC TCTAGCTGAG ATTTCTTCAT AGCATAAACT 2 040 

TCTTGGAACT GGTAGTAGTC ACCAGAGTAA CGCGTCAGCT GTTGATTTTC CACATGATAG 2100 

ACAATATTAA TAACGTCATT GAGGAATGGA ATATCGTGCG AAATGAGAAC AAAGGCATTC 2160 

TCATAGTTTT GGAGATAGCG CTTGAGCCAA TCAATATGCT CAGCATCCAA GTAGTTGGTC 222 0 

GGCTCGTCCA ACAGCAAGAT ATCAGGCTTT TCAAGGAGAA GTTTTGCCAA AAGCACCTTG 22 80 

GTTCTTTGCC CACCTGACAA AGAAGTTACA TCCGTATCCA TGCCAAAGTC CATAACACCA 2340 

AGAGCACGCG CTACTTCGTC AATCTTAGCA TCCAAGGTAT AGAAATCACG ACTCTCCAGA 2400 

CGGTCTTGAA GTTCTCCTAC TTCTTCCATG AGAGCATCAA CATCCGCGCC GTCTTCAGCC 24 60 

ATTTTCATAT AGAGGTCATT GATACGAGCT TCAGCTTTGA AAAGCTCATC AAAAGCCGTA 252 0 

CGGAGAACAT CACGCACCGA CTGTCTTTCA GCAAGGACAG AGTGCTGATC CAAGTAACCA 2580 
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GCCGTCACAT ATTTGGACCA CTCAACCTTT CCTTCATCTG GCAGCATTTT ACCAGTCACG 2 640 

ATACTCATAA AGGTTGATTT TCCTTCACCA TTGGCACCGA CCAGGCCGAT ATGTTCTCCC 2700 

TTGAGGAGAC GGAAGGACAC ATCTTCAAAA ATTGCACGGT CACCAAAACC GTGACTCAGA 27 50 

TTTTTAACTT CTAAAATACT CATTTTAATT CCTTACCTTG TTTTTATGTA ATCGTTTATA 2 820 

AAGGAGCCAA GCCAGATAGC CACCCAAAGT GTTGGTCCAC AAATCATCAA TCTCAAAGAC 2880 

GCGATTGAAA TCAAAGAAAA AGTCCAAGAT TAATTGCGTA CACTCGATTC CAAGACTCAC 2 940 

AAGAAAACTA AAAAGAAGGA CCTTTTTTGT TTTCCGCAAA TTTGGAAATA GATAAAGGAG 3000 

TTGGAAAATC AGAGGAAAAA ACAAGAAGAC ATTGAGGATA TTTTGTAAAA AAATCCAACA 3 060 

TAATTGTCCA ATGTCACTCA CTTCGCCCAG TTTCCAGAGA GAATTGAAAG GAGTCAAAAG 3120 

AAAAACCAGG CGTCCAAGAT GCTGAATACC TGGAGTTCCC ACTCCCACGG TAGATTGTTC 3180 

TTGAGGAGTA AAGCAAAAAC AGACAATGCA AATGCTATAG AAAATRACTC CCCAGACCAA 3240 

AATATGATTA TAAGTCTTCT TCATCATTAA GGATTTACCG CTGCGACTGC CTTCTGGCGG 3 300 

TCACGTTTCA TTGTGTTAGA GCGCAATTGT CCACAAGCTG CGTCAATATC TGTACCATGC 33 60 

TCTTGACGAA CCACACAGTT GACCCCTTTT TTCTTAAGCG TATCATAGAA AGCCAACACG 3 42 0 

CACTCTTTGG GACTACGGCT ATATTGGTCA TGCTCACTAA CTGGGTTATA AGGAATCAAG 34 8 0 

TTTACATAAG ACAATTTCTT GATGTTCTTG AGCAATTCAG TCAATTCCAA GGCTTGTTCT 354 0 

ACACCGTCGT TGACTTCATT AAGCATGATA TATTCAAAGG TTACACGACG GTTTGTTGTC 3600 

TCAATGTAGT ATTCAATAGC AGCAAAGAGT TTTTCAATCG GAAAGGCACG GTTAATCTTC 3 660 

ATGATACTTG AACGAAGTTC ATTGTTAGGT GCGTGAAGAG ACACGGCAAG ATTGACCTGA 3 72 0 

ACCCCTTCAT CAGCAAAGTC ACGAATTTTA TGAGCCAAAC CTGAGGTTGA AACCGTGATG 3780 

TGACGAGCAC CGATAGCCAT TCCTTTATCA TCATTGATAG TACGAAAGAA ATTCAAGACA 3 84 0 

TTGTTGTAAT TATCAAAGGG CTCACCGATT CCCATGACAA CGATATGGCT GATGCGTTCA 3 90 0 

TCCTGACCAC GCTCATCAAA GTATTTCTGA ACCAGCATGA TTTGCGCTAC GATTTCACCG 39 60 

TTATTGAGGT CACGTTGCTT CTTAATCAAA CCAGAGGCAC AGAAGGTACA ACCGATATTA 402 0 

CAGCCGACCT GAGTGGTCAC ACAGACAGAT AAAC CAT AG T GTTGACGCAT GAGTACAGTC 4080 

TCAATTAACA TACCGTCGGG CAATTCAAAG AGATATTTGA CTGTACCATC AGCAGACTCT 414 0 

TGCACAATAC GTTGTTTCAA GGGATTGACC ACAAACTGGT CATTGAGCTT AGCAATCAAA 4200 

TCCTTGGAAA GGTTGGTCAT TTCTTCAAAT GACTGCACAC GTTTACGGTA GAGCCATTCC 42 60 

CAGATTTGAT CTGCACGGAA TTTCTTTTCT CCCTGCTCCA ATACCCATTC CTGCATGGTT 432 0 

TGATGTACCA AACTATGAAT TGAGGGTTTC ATTTCTTCTC CTTATTCTCT ACTCACTTCT 4380 
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GACGAATGAC AAAATGACGT TGTCCCTTGT CGTCTTTCTG ACGACGTCTA TTTTTCTTAT 4440 

CTGCATTCGA CTTTCGTTTA GTTTGAGTCG GTTTCTTTCC TTTTCTAGAA GGTGTTTCTT 4 500 

CTTCCGTCTT ACGCATTTTC TTGTCAAATG ATGCTCGCTT AGGGGCTTCA TTTTCTAAGA 4 5 60 

CAAAATAGGC ACAACCATAA CTACAATACT CTAAAAGGTA GTCTTGTAAA CGACTGATTT 4 62 0 

TTTCAAGTTT TTCTTCTGTT CGGTCATCCT TGTAAAAACC TCGTAGGCGA AGCTGTTCGT 4680 

TGCTCCAGTC CCCCACGATA TAATCAAACT TGGTTAATAC TTCTGAAAAA CGCTGATTAA 4740 

AAGTCGTCAC ATCAAAGGCA TCCTTGATAT TTTCAACCAA GGAAAAAGCT ATCCCTTCCG 4 800 

TTTCGACCTT GTCCCCGTGT AAATGGAACT CCGGACCAGG AAACTTGTTA TAGTTGTATA 4 8 60 

ATTCAGGTGC AATTTCTTTT CGCATAGATA TCCTTTTTTC ACGATTACTT AATACTTTAT 4 920 

TCTACCATAA TTTCTAGCAG TTAGCACGTT TCTCATAAAA ATGAAAAAAG TCTGACGATT 4980 

TTGTCAGACC AGAATCTTAT AACCTAAAAA GAGAAGAACA ATTCTTCCCT CCAACTATCA 5 04 0 

TTATTTAGCA GCTGCGTACA ATTCATCTAC TTTATTCCAG TTGATTACTG AAAAGAAAGC 5100 

TTTGATGTAG TCAGGACGCA CGTTGCGGTA TTTCACGTAG TAAGCATGTT CCCAAACGTC 5160 

CAAGCCCAAG ATTGGTTTTT TACCTTCTGA GATTGGTGTG TCTTGGTTTG CTGTTGAAGT 52 2 0 

CACTTCAAGT TTCCCTTCTT TGTTGACAAC CAACCATGCC CAACCTGAAC CAAAACGAGT 5280 

TGTTGCTGCT GCAGTGAAGG CTGCTTGGAA TTCTTCAAAT GAACCAAATG TTGCATCGAT 5 34 0 

TGCTGCTGCC AGTTCTGCTG AAGGAGCTGT TTTCTCGGGA GTCATCAATT CCCAGAAAAG 5400 

AGCGTGGTTC AAGTGTCCGC CACCATTGTT GATAAGTGCT TGACGGATAT CAGCTGGGAT 54 60 

AGATTCTACA TCAGCAAGCA AGGCTTCAAG GTCTTCACCG ATTTCAGGGT GTTTTTCTAA 552 0 

AGCTGCATTG GCATTGTTGA CATAAGTTTG ATGGTGTT 5558 
(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6735 base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

GGAATTGTAA ATATCATATT GTTTTTGCAC CCAAATATCG TCGTCAAATC ATTTATGGCA 60 

GATACAAAGC TAGTATCGGA AGAATCATAC GTGACTTATG TGAGCGTAAG GGTGTAATAA 12 0 

TCCATGAAGC GAATGCTTGT TCAGACCATA TTCACATGCT TATCAGTATT CCTCCGAAAC 18 0 
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TTAGTGTTTC GTCCTTTATG GGCTATTTAA AGGGCAAGAG CAGTTTGATG ATTTTTGATA 240 

AGCATGCGAA TTTAAAATAC AAATATGGCA ATCGCAAGTT TTGGTGTAGA GGCTATTATG 300 

TAGATACGGT AGGCCGTAAT CAGAAAGTGA TAGCTGAATA TATTCAGAAT CAATTACAAG 360 

AAGACAGAGT AGCAGACCAG CTCACGTTAT TCGAGTCAGT AGATCCGTTT ACTGGCGAAA 420 

TAAATAAGAG GAAGTAACTA AGGTGCTTTA GCACCTGCTC GGGAAAGTGG TGCGCGAGGA 480 

AGCTATTTCG GTGGGCCTTT GGCCCTGGCC GGTAGAAGCG GCTTATAGCC GCAGAACAAA 540 

CCACCAGTTC ACACTGGTGG TTTTGATTTA AAAAACTTGA TACATAAAAA TAAAAGTCTA 600 

TATAAAGGAT GGTAAAATTC CTGTTGTCCG ATTTGGACAA TATCCTAAAT AGTTACAATA 560 

TATGGTCTAT ACTTTTTCTT AGGAGAAAGC TAGATGTACA GACGTTTGAG AGATTTGAGG 72 0 

GAGGATCATG ATCTGCCCCA AAAGCAAATA GCTACAATAC TTTCGTTTAC AAATTCAGCT 780 

TATGCCAAAA TTGAACGGGG TGAGCATGCG TTGACGGCTG ATGTATTGGT TAAACTCTCA 840 

GATTTCTATG ACGTCAGTAC AGACTATTTA TTGGGATTAA CTGATTTTCC TGATAAAATT 9 00 

CGCTTTAGAA AATAATCTCC TCAATTTCAT AGAGTTTGAA AATGAGTGAG ATTTTTTATT 960 

TGCCCTTTGA CAACTGAATA GCCTAAAATG GTACTTTCCT CATTTGTGGA GCAAATTTGA 1020 

ATGGCTCGCC ATGATAAGAG CGATTTTAAA ATCATCAATA AAATAGAGCG ATACTTTATA 1080 

TGCCATGATA CAAATGATAT ACAATGATAC TTCTGACCGT TCAGCCTGCC AACGTAAAAG 1140 

AGCAGCAAGT GAAATTCTTA TGATGACTTC ATCAGTCATG CCACGTTGAA TGTGTGAGTT 12 00 

TGTTAGATAA ACGCAATTAA TCCTCAAAAG GTTCCCCGAA CCTTTTGAGT TCTACAGACG 12 60 

CATCACGTGG AGTGTGTAAG CTTGTTGCTA AAAGCGTAAA AACCTTGGAA CGAAAGGAAT 13 20 

AATAGACTTT CTGCGAAACA AAAATATAAT ACAATAAAAC TATGAATGAT GAAGCAAGTA 1380 

AACAATTGAG CGATAGCCGT TTCAAGATCC TTGTAGGTGT TCAGCGCACG ACTTTTGAAG 1440 

AGATGTTAGC TGTGTTAAAA ACAGCTTATC AACGTAAACG CGCAAAAGGT GGACGAAAAA 1500 

GCAAATTAAG CCTAGACGAT CTCCTTATGG TAACTATTCA ATACATGCGA GAATAGAGCA 15 60 

CTTATGAACA AATTGCGGCT GATTTTGGCA TTCACGAAAG CAACTTAATC CGTCGGAGTC 162 0 

AATGGGTTGA AGCAACTCTT ATTCAAAATG GTTTTACC-AT TTCAAATTCT GCCTTAATTC 1680 

TGTAAAAACA GTAAAATTCG AAGGATTGTA AGGTAAGAGT TTTTTTCTTT CTGAAAAAAT 1740 

GGTATAATAG CAATCAAAAC TAGAAAATAA AACGGAATTT GGAACAGATT TGTCTGTATC 1800 

CTAGTAGAGT GGTGATACTA TGAAGATTAG TAAGAGGCAC TTATTAAATT ATTCCATCTT 18 60 

GATTCCCTAC TTGCTTTTAT CTATTTTGGG CTTGATTGTG GTCTATTCGA CCACCAGTGC 192 0 

TATTTTAATT GAAGAAGGCA AGAGCGCCTT GCAGTTGGTT CGAAACCAAG GAATCTTTTG 1980 
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GATTGTTAGT TTGATACTGA TTGCCTTAAT TTATAAATTG AGACTAGATT TTTTGAGAAA 2040 

TGAGCGACTA ATCATTTTAG TTATATTAAT AGAAATGCTT TTATTGTTCT TGGCTCGTTT 2100 

TATTGGTATT TCCGTAAACG GGGCATACGG TTGGATTTCG GTTGCAGGAA TAACTATTCA 2160 

GCCAGCTGAG TACTTAAAAA TCATTATTAT TTGGTATTTA GCTCACCGAT TCTCCAAACA 222 0 

GCAAGAAGAA ATAGCTACTT ATGATTTTCA AGTTTTGACT CAAAATCAAT GGCTTCCCCG 22 80 

TGCTTTTAAT GATTGGCGAT TCGTTCTCC'T AGTTCTGATT GGAAGTTTGG GAATTTTCCC 234 0 

TGATTTAGGA AATGCGACTA TTTTAGTCTT GGTTTCCTTG ATTATGTATA CAGTTAGTGG 2400 

AATCGCTTAT CGCTGGTTTT CAACCATTCT GGCGCTCGTA TCTGCCGCTT CTGTCTTTGT 24 60 

CTTGACCACT ATCAGCCTAA TCGGTGTTGA GACCTTTTCA AAAATTCCAG TATTCGGCTA 252 0 

TGTAGCCAAG CGCTTTAGTG CCTTTTTTAA TCCTTTTGCC GATCGTGCTG ATGCAGGTCA 2580 

CCAGTTAGCT AATTCTTATT TTGCCATGGT CAATGGCGGT TGGTTTGGTC TAGGTCTTGG 2 64 0 

AAACTCGATT GAAAAACGAG GTTATTTGCC AGAAGCTCAT ACAGACTTTG TCTTTTCTAT 27 00 

CGTGATTGAA GAATTTGGCT TTGTTGGTGC CAGTCTTATT TTAGCTCTCT TGTTTTTCAT 27 60 

GATTTTGCGG ATTATCTTGG TCGGTATCCG AGCGGAGAAT CCTTTCAATG CCATGGTTGC 282 0 

ACTCGGTGTC GGAGGGATGA TGTTGGTTCA GGTATTTGTC AATATCGGAG GGATTTCGGG 2880 

CTTGATTCCA TCTACAGGAG TGACTTTCCC CTTCTTATCC CAGGGTGGAA ATAGTCTTCT 294 0 

AGTCTTATCA GTGGCAGTAG CCTTTGTCTT AAATATTGAT GCCAGTGAAA AACGCGCTAA 3000 

ATTGTACCGA GAATTGGAAA ATCAACCAAT GAACCTTCTG TTGAAGTAGG ATAAAGAAAG 3060 

GATAGTTTAT GTCTCTTCAA AAATTAGAAA ATTATAGTAA TAAAAGTGTT GTGCAAGAAG 312 0 

AAGTCTTGAT TCTAACAGAA TTACTGGAAG ATATTACTAA AAATATGCTT GCCCCAGAGA 318 0 

CCTTTGAAAA AATAATACAG TTGAAAGAAT TATCAACGCA GGAAGATTAT CAAGGTCTAA 324 0 

ACCGTCTAGT GACTAGCTTA TCAAATGATG AAATGGTCTA TATTTCACGC TATTTCTCTA 3300 

TCTTGCCTCT TTTGATTAAT ATTTCAGAGG ATGTGGATTT AGCTTATGAA ATCAATCATC 3 3 60 

AAAATAATAT TGATCAGGAC TATTTAGGTA AATTATCTAC AACGATTAAA TTGGTAGCAG 3 420 

AAAAGGAAAA TGCCGTTGAG ATCCTAGAAC ACTTGAATGT TGTCCCTGTT TTGACAGCCC 3480 

ATCCAACACA AGTGCAACGC AAAAGTATGT TGGATTTAAC AAATCATATT CATAGTCTTT 3 540 

TGCGTAAATA CCGTGATGTT AAGTTGGGGT TGATCAATAA AGATAAATGG TACAATGATT 3 600 

TGCGTCGTTA CATCGAAATT ATCATGCAGA CAGACATGAT TCGTGAGAAA AAATTAAAAG 3 660 

TGACTAACGA AATCACGAAT GCTATGGAAT ATTATAACAG CTCCTTTTTG AAAGCTGTAC 3720 
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CTCATTTGAC GACGGAGTAT AAGCGCTTAG CGCAAGCGCA TGGTCTGAAT TTAAAACAGG 3 780 

CTAAACCAAT CACCATGGGT ATGTGGATAG GTGGTGACCG TGATGGAAAT CCATTTGTTA 384 0 

CAGCAAAGAC CTTGAAGCAG TCTGCACTCA CTCAGTGTGA AGTCATCATG AACTACTATG 3900 

ATAAAAAGAT TTACCAACTT TATCGTGAAT TTTCTCTTTC AACTAGCATT GTCAACGTCA 3960 

GCAAGCAAGT CAGAGAAATG GCTCGTCAAT CCAAGGATAA CTCGATTTAC CGCGAAAAAG 402 0 

AGCTTTACCG TCGTGCCTTG TTTGATATTC AATCAAAAAT TCAGGCAACT AAAACCTATC 408 0 

TGATTGAGGA TGAAGAAGTT GGGACTCGTT ATGAAACCGC CAATGATTTC TACAAGGATT 4140 

TGATTGCCAT TCGAGATTCT CTACTAGAAA ATAAGGGCGA GTCCTTGATT TCAGGTGATT 4200 

TTGTGGAATT ATTGCAGGCA GTAGAGATAT TTGGTTTTTA CTTAGCATCA ATTGATATGC 42 60 

GACAAGACTC TAGCGTCTAT GAAGCCTGTG TGGCAGAACT CTTGAAATCA GCAGGAATTC 4320 

ATTCTCGTTA TAGCGAGTTG AGCGAAGAAG AAAAGTGTGA CCTTCTCTTG AAAGAATTAG 43 8 0 

AAGAAGATCC CCGAATTCTT TCTGCGACTC ACGCAGAAAA ATCAGAATTA TTAGCAAAAG 444 0 

AATTAGCTAT TTTTAAGACG GCTCGTGTTT TGAAAGATAA GTTGGGAGAT GATGTCATCC 4500 

GTCAGACCAT CATTTCACAT GCAACCAGCC TTTCTGATAT GCTAGAATTA GCTATTCTGT 4560 

TAAAAGAAGT AGGACTGGTG GATACGGAAA GGGCGCGTGT TCAGATTGTT CCCCTTTTTG 4620 

AAACAATTGA AGACTTGGAT CATTCAGAGG AAACAATGAG AAAATATCTT TCTCTTAGCC 468 0 

TTGCCAAAAA ATGGATTGAC TCACGAAATA ACTACCAAGA AATCATGCTT GGCTACTCTG 4740 

ACAGTAATAA AGATGGCGGT TACTTGTCAT CATGTTGGAC CCTCTACAAG GCTCAACAAC 4800 

AATTGACTGC TATTGGAGAT GAATTTGGCG TTAAGGTTAC CTTCTTCCAT GGTCGTGGTG 4860 

GTACTGTCGG TCGTGGTGGT GGGCCAACCT ATGAAGCCAT TACATCTCAA CCGCTCAAGT 4920 

CTATCAAGGA TCGTATCCGC TTGACGGAGC AGGGTGAAGT AATTGGGAAT AAATACGGTA 4 980 

ACAAAGACGC CGCTTACTAT AACCTTGAAA TGCTAGTATC GGCAGCTATT AACCGTATGA 5040 

TTACTCAGAA GAAGAGCGAT ACCAATACCC CAAATCGTTA TGAAACCATT ATGGATCAAG 5100 

TAGTGGACCG TAGTTACGAT ATCTACCGTG ATTTGGTCTT TGGTAATGAG CATTTCTATG 5160 

ATTATTTCTT CGAGTCAAGT CCAATCAAGG CTATTTCAAG TTTTAATATT GGTTCTCGTC 5220 

CAGCCGCTCG TAAGACTATT ACTGAAATCG GTGGTTTGCG TGCCATCCCT TGGGTATTCT 5280 

CATGGTCACA GAGTCGTGTT ATGTTCCCTG GATGGTACGG GGTTGGTTCA AGCTTCAAGG 5340 

AATTTATCAA TAAAAATCCA GAGAATATTG CTATCTTACG AGATATGTAC CAAAATTGGC 5400 

CTTTCTTCCA ATCGCTTCTT TCAAATGTTG ATATGGTTTT GTCAAAATCA AATATGAATA 5460 

TTGCTTTTGA ATATGCTAAA CTTTGTGAAG ACGAGCAAGT TAAGGCCATC TATGAGACTA 5520 
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TTTTAAATGA ATGGCAAGTT ACTAAGAACG TTATCTTGGC TATTGAAGGA CATGACGAAC 5580 

TCTTAGCTGA CAATCCATAT CTAAAAGCTA GTCTGGATTA CCGTATGCCT TACTTTAATA 564 0 

TTCTCAACTA TATTCAGTTG GAGTTGATTA AACGCCAACG TCGTGGAGAA TTGTCCAGTG 57 00 

ATCAAGAACG ATTGATTCAT ATCACCATCA ACGGAATTGC GACAGGATTG CGTAATTCAG 57 60 

GTTGATAATT TTCAAGAGTG AATGCTAAAA GTGAATATCA AAAAAATTCT AATAGACTAT 582 0 

TGACAAGTAG TTTAAAAATG ATATAATTTA ACCATTCAGA AAAGTAATCA TACAAACTTT 58 80 

TTAGAGAGTC TGTGGTAGCT GAAAACAGAT AAGTGGCAAT GATGAAAATT GGGCTGAATG 5940 

CTATTTAGAA TTTGAAATTA TAAAAATTCG GTAAGCACAC CTTACAGTGC ATCTCGTTAT 6000 

TGCGAGACTG AGCGATAGGG AAATTCCCTA TAATTGAGGT GGTACCGCGC ATCGACGTCC 606 0 

TCACACAAGT TTTTTGTGTG AGGATTTTTT TGATGGAGGT TAGTATGGAA AGAAAACGAT 612 0 

GGCGTCGCTT GTTTAGATAA GTGAAATATG TTAAAGGAAA TAAAAAGGAG AAACAGAATG 6180 

AAAAATAAAC GTTTAATTGG AATTATTGCT GCATTAGCAG TCTTAGTAGC AGGAAGCTTG 624 0 

ATTTATTCTT CAATGAATAA ATCAGAAGCT CAGAATAATA AGGATGAGAA GAAAATAACC 630 0 

AAGATTGGTG TGCTTCAATT TGTGAGCCAT CCATCCCTTG ATTTGATTTA TAAA.GGGATC 6360 

CAAGATGGAC TTGCAGAAGA AGGATATAAA GATGATCAAG TTAAAATTGA TTTTATGAAC 6420 

TCAGAAGGTG ACCAAAGTAA GGTTGCGACA ATGAGTAAAC AATTGGTTGC AAATGGGAAT 64 80 

GACCTTGTGG TTGGTATCGC AACACCAGCA GCCCAAGGGT TGGCTAGTGC AACAAAAGAC 6540 

CTACCGGTTA TCATGGCCGC TATTACAGAC CCAATTGGTG CTAACTTGGT TAAAGATTTG 6600 

AAAAAACCAG GTGGCAACGT TACAGGGGTA TCTGACCACA ATCCAGCTCA ACAACAAGTT 6 660 

GAACTCATCA AGGCTCTGAC ACCGAATGTG AAAACAATCG GAGCTCTTTA CTCAAGTAGC 672 0 

GAAGACAATT CAAAA 6735 
(2) INFORMATION FOR SEQ ID WO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 

CTAGAGGATC CCAGCAGGTA AATTGGCTTC AGCTGGCAAA AAAGTTGCCC TCGTTGAACG 6 0 

CAGCAAGGCT ATGTACGGTG GAACTTGTAT CAACATTGGT TGTATCCCAA CTAAAACCTT 120 
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GCTAGTTGCT GCTGAAAAGG ACTTGTCTTT TGAAGAAGTC ATTGCTACTA AAAACACGAT 180 

CACTGGTCGC CTCAACGGTA AAAACTATGC GACTGTTGCT GGTACAGGCG TAGATATCTT 24 0 

TGATGCGGAA GCTCACTTCC TTTCAAATAA AGTCATCGAA ATCCAAGCTG GTGATGAAAA 300 

GAAAGAACTG ACTGCTGAAA CAATCGTCAT CAACACTGGT GCTGTTTCAA ACGTCTTGCC 3 60 

AATCCCTGGA CTTGCTACAA GCAAAAACAT CTTTGACTCA ACAGGTATCC AAAGCTTGGA 42 0 

CAAATTACCT GAAAAACTTG GAATCCTTGG TGGCGGAAAT ATCGGTCTTG AATTTGCCGG 480 

CCTTTACAAC AAACTTGGAA GCAAGGTCAC AGTCCTAGAT GCCTTGGATA CATTCCTACC 54 0 

TCGTGCAGAA CCTTCCATCG CAGCTCTTGC TAAACAATAC ATGGAAGAAG ATGGCATTGA 600 

ATTGCTTCAA AATATCCATA CTACTGAAAT CAAAAACGAT GGTGACCAAG TGCTTGTCGT 660 

AACTGAAGAC GAAACTTACC GTTTCGACGC CCTTCTCTAC GCAACTGGAC GCAAACCAAA 720 

TGTAGAACCA CTTCAACTTG AAAATACAGA TATTGAACTA ACTGAACGTG GTGCTATTAA 780 

AGTAGACAAA CACTGTCAAA CAAACGTTCC TGGTGTCTTT GCAGTTGGAG ATGTCAACGG 840 

TGGCCTTCAA TTTACTTACA TTTCACTTGA TGACTTCCGT GTTGTTTACA GCTACCTTGC 900 

TGGAGATGGC AGCTATACAC TTGAAGACCG TCTCAATGTG CCAAATACTA TGTTCATCAC 960 

ACCTGCACTT TCACAAGTTG GTTTGACTGA AAGCCAAGCA GCTGATTTGA AACTTCCATA 1020 

CGCTGTTAAG GAAATCCCCG TTGCAGCAAT GCCTCGTGGT CACGTAAATG GAGACCTTCG 1080 

CGGTGCCTTC AAAGCTGTTG TCAATACTGA AACAAAAGAA ATTCTTGGAG CAAGCATCTT 1140 

CTCAGAAGGT TCTCAAGAAA TCATCAACAT CATCACTCTT GCTATGGACA ACAAGATTCC 1200 

TTACACTTAC TTCACAAAAC AAATCTTCAC TCACCCAACC TTGGCTGAGA ACTTGAATGA 12 60 

CTTGTTTGCG ATTTAAGTTG AGATTTAATC GTATCGAACA GCCCTCTTTG GGCTGTTTTT 1320 

ACTTCTGCGG AATCTCAAAT CTGTCTTTCT CCTCTTTTAT GATATAATAG AAACATGAAC 13 80 

TTAAAAACTA CTTTGGGCCT TCTTGCTGGG CGTTCTTCCC ACTTCGTTTT AAGCCGTCTT 1440 

GGACGTGGAA GTACGCTCCC AGGGAAAGTC GCCCTTCAAT TTGATAAAGA TATTTTACAA 1500 

AACCTAGCTA AGAACTACGA GATTGTCGTT GTCACTGGAA CAAATGGAAA AACCCTGACA 1560 

ACTGCCCTCA CTGTCGGCAT T TT AAAAG AG GTTTATGGTC AAGTTCTAAC CAACCCAAGC 1620 

GGTGCCAACA TGATTACAGG GATTGCAACA ACCTTCCTAA CAGCCAAATC TTCTAAAACT 1680 

GGGAAAAATA TTGCCGTCCT CGAAATTGAC GAAGCCAGTC TATCTCGTAT CTGTGACTAT 1740 

ATCCAGCCTA GTCTTTTTGT CATTACTAAT ATCTTCCGTG ACCAGATGGA CCGTTTCGGT 1800 

GAAATCTATA CTACCTATAA CATGATATTG GATGCCATTC GGAAAGTTCC AACTGCTACT 18 60 

GTTCTCCTTA ACGGAGACAG TCCACTTTTC TACAAGCCAA CTATTCCAAA CCCTATAGAG 1920 
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TATTTTGGTT TTGACTTGGA AAAGGGACCA GCCCAACTGG CTCACTACAA TACCGAAGGG 1980 

ATTCTCTGTC CTGACTGCCA AGGCATCCTC AAATATGAGC ATAATACCTA TGCAAACTTG 2040 

GGTGCCTATA TCTGTGAAGG TTGTGGATGT AAACGTCCTG ATCTCGACTA TCGTTTGACA 2100 

AAACTGGTTG AGTTGACCAA CAATCGCTCT CGCTTTGTCA TAGACGGCCA AGAATACGGT 2160 

ATCCAAATCG GCGGGCTCTA TAATATCTAT AACGCCCTAG CTGCTGTGGC CATCGCCCGT 2220 

TTCCTAGGTG CCGATTCGCA ACTCATCAAA CAGGGATTTG ACAAGAGCCG TGCTGTCTTT 2 2 80 

GGACGCCAAG AAACCTTTCA TATCGGTGAC AAGGAATGTA CCCTTGTCTT GATTAAAAAT 2 340 

CCAGTCGGTG CAACCCAAGC TATCGAAATG ATCAAACTAG CACCTTATCC ATTTAGCCTA 2 400 

TCTGTCCTCC TTAATGCCAA CTATGCAGAT GGAATTGACA CTAGCTGGAT CTGGGATGCA 2460 

GACTTTGAAC AAATCACTGA CATGGACATT CCTGAAATCA ACGCTGGCGG TGTTCGTCAT 2 520 

TCTGAAATCG CTCGTCGCCT CCGAGTGACT GGCTATCCAG CTGAGAAAAT CACTGAAACG 2 580 

AGTAATCTGG AGCAAGTTCT CAAGACCATT GAGAATCAAG ACTGCAAGCA TGCCTATATT 2 640 

CTGGCAACTT ATACTGCCAT GCTGGAATTT CGTGAACTGC TGGCTAGTCG TCAGATTGTT 2700 

AGAAAGGAGA TGAACTAATG GTTTATACTT CACTTTCCTC AAAAGATGGC AATTACCCCT 2760 

ATCAGCTCAA CATTGCCCAC CTCTACGGAA ATCTCATGAA TACtACGGGG ACAATGGAAA 2 82 0 

CATCCTCATG CTCAAGTATG TGGCTGAAAA ACTGGGAGCC CATGTGACCG TTGACATCGT 2880 

TTCTCTCCAT GATGACTTTG ATGAAAATCA CTACGACATC GCCTTTTTCG GTGGTGGTCA 2940 

AGACTTTGAA CAAAGTATCA TTGCAGACGA CCTACCTGCT AAAAAAGAGA GCATTGACAA 3 000 

CTACATCCAA AACGACGGTG TAGTTCTGGC TATCTGCGGT GGTTTCCAAC TATTGGGTCA 3 0 60 

ATATTATGTT GAAGCTTCAG GAAAACGTAT CGAAGGGCTA GGGGTCATGG GACACTACAC 3120 

GCTCAACCAG ACCAATAACC GTTTTATCGG TGACATCAAG ATTCACAATG AAGATTTCGA 3180 

TGAAACCTAC TATGGATTTG AAAATCACCA AGGTCGTACC TTCCTCTCTG ATGACCAAAA 3 240 

ACCGCTGGGA CAGGTTGTCT ATGGAAATGG AAACAACGAA GAAAAGGTCG GTGAAGGGGT 3 3 00 

TCATTATAAG AATGTCTTTG GTTCCTACTT CCACGGGCCT ATCCTCTCTC GTAATGCCAA 3 3 60 

TCTGGCTTAT CGCCTAGTTA CTACTGCCCT CAAGAAGAAA TATGGTCAGG ACATCCAACT 3420 

CCCTGCCTAT GAGGACATTC TCAGCCAAGA AATCGCTGAA GAGTACAGTG ACGTCAAAAG 3480 

CAAGGCTGAC TTTTCTTAAA CAAAGGAAAA TGATATCAAA GAACTCCGTT ATCTTGTCGG 3 540 

AGTTTTTTGT CTTTTCTTTT ACCCTTCTCC CTTGCATTTT CTCTCATTTT TTGCCAAAAT 3 600 

AGAGGGGTAG AAAGAAGGTA GCATATGTCT AAATTACAAC AAATCCTAAC ATATCTTGAA 3 660 
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TCAGAAAAAC TAGACGTCGC TGTCGTATCT GACCCCGTCA CAATCAATTA CCTCACTGGT 3720 

TTTTACAGTG ATCCCCATGA ACGCCAAATG TTCCTCTTTG TCCTAGCAGA TCAGGAACCT 37 80 

CTCCTCTTTG TCCCAGCTCT TGAAGTAGAA CGTGCAAGTA GCACCGTTTC CTTCCCAGTA 384 0 

GTGGGCTATG TCGATTCTGA AAATCCATGG CAAAAAATCA AACATGCTCT TCCACAACTT 3 900 

GACTTCAAAC GTGTCGCTGT TGAGTTTGAC AATCTCATCT TGACCAAATA CCATGGTTTG 3960 

AAAACAGTTT TTGAGACTGC TGAGTTTGAC AACCTCACTC CTCGTATCCA ACGCATGCGC 402 0 

CTCATCAAAT CAGCTGATGA AGTGCAAAAA ATGATGGTTG CAGGTCTTTA TGCTGACAAG 408 0 

GCTGTTCATG TTGGTTTTGA CAATATTTCT CTTGATAAGA CTGAGACAGA TATCATCGCA 414 0 

CAAATCGACT TTGCCATGAA ACGTGAAGGT TATGAAATGA GCTTTGATAC CATGGTCTTG 420 0 

ACTGGTGATA ATGCTGCGAA TCCACACGGC ATTCCAGCAG CTAATAAGGT TGAAAATGAT 42 60 

GCTCTTCTCC TCTTTGACCT GGGTGTTCTG GTCAATGGCT ATGCGTCAGA TATGACTCGT 43 2 0 

ACAGTCGCTG TCGGCAAACC AGACCAATTC AAGAAAGATA TTTACAACTT GACTCTTGAA 43 8 0 

GCCCAACAAG CTGCTCTTGA CTTTATCAAG CCAGGTGTGA CTGCTCATGA AGTGGACCGC 44 4 0 

GCTGCCCGTG AGGTCATCGA AAAAGCTGGT TATGGTGAGT ACTTCAACCA CCGTCTCGGG 4 500 

CATGGTATCG GTATGGATGT CCATGAATTC CCATCTATCA TGGAAGGAAA CGACATGGTC 4560 

ATCGAAGAAG GCATGTGCTT CTCTGTTGAA CCAGGTATCT ATATCCCTGG TAAAGTCGGT 462 0 

GTTCGTATTG AAGACTGCGG TGTTGTTACC AAGGATGGCT TCAACCTCTT TACAAGCACC 4680 

AGCAAAGATT TGCTTTATTT TGATTAAACT ATATAGCCCC TATGCTTTCC TTTCAAAATA 474 0 

TCTAGGGGCT ATTTTATTGT CATTTTTCTG CTATTATGCT AAAGAAATTG GCTGCAATAA 4800 

TCTAACCCTA AGTGTCTGGA ATGATAACGA GGGTGCTCTC CGCTTTTATC AAAGACAAGG 4860 

GATGAAACCC CAAGAAACAA CAATGGAAAT GATAATTGAT TAAGAAGTCA TCTATCAAAA 492 0 

GATGTTAGAA AAAGTTCAAT TTCACTAGAA AATGAGGAAA ATCTCCCCAC AATAAAACGC 4980 

ATAGTATCAG GTATTGTGTA CTGACCCCAA ACAGTTAGAC AATTAATTTA TCCGAAGGAT 504 0 

TTAGTTCTGT ACTGCACAGG ACTAAGTCCT TTTAGTTTTA CCTTAATTCG TTTGTTGTTG 5100 

TAGTAATCAA TATAGTCTAT AATGACTTGT TCCAATTGGT TAAGTGATTT AAATGTTTTC 5160 

TCATAGCCAT AAAACATTTC GGATTTTAAA ATGCCAAAGA AAGATTCCAT CATACCGTTG 522 0 

TCTTGGCTGT TTCCCTTGCG TGACATAGAT GCTTGAATTC CCTTATTCTC TAGGAACCGA 5280 

TGATAAGAAT CGTGTTGGTA TTGCCAGCCT TGGTCACTAT GGAGAATCGT ATTCTCGTAG 5340 

TGCTTCTCTT TGAATGCCTG TTCCAACATT GTTTGTACTT ATTCTAAATT AGGCGAACAA 5400 

GAAAGATTAA AAGCAATAAT TTCGCTGTTA AAGCCATCTA AAACTGGTGA TAAGTAAAGC 5460 
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TTTTGAGTAC TTGCTGGAAT GGCAAATTCA GTCACATCTG TGTAGCACTT TTCCATTGTT 552 0 

TTAGAGCCTT CAAATTGGGC TTGAATGAGA TTCTCTGCCT TCTTACCAAC GTCTCCTTTA 5580 

TGAGAAGAAT ATTTTCGTTT CTTTCGCATT TTAGCTTGTA AATTGAGTAC TTTCATCAAG 5 640 

CCTTGAACTC TTTTATGATT TACCAGATAA CCACGATTTC TTAGTTCTAA ATGAACCCGG 57 00 

CGATAAGCAT AATTTCCCTT GTGTTCGATA AAGATGGATT GAATTTCAGT TTTAAGCTCT 57 60 

TGGTCTTTAT CTGTTTTGTC TAGCTGTTTC AAGTGATAGT AGTAGGTCCA ACGAGCTAGT 582 0 

TTAATGGCTT CTAGAAGAAG ATCTAACGAA AACTCAGTCA TTAATTCTTG AACAATTTCT 58 80 

GTCTTTCTTC TTTCTCTTTT TCCTCCTTCA ATCGGAGTTC TCTTAACTTT TTTAGGATGG 5940 

CATTCTCCGC TCTCAGGTAC TCTCCCTCTT GTTTTCTCAA CAATAGTATA CCCGTTTTTC 6000 

CTGTATTGTG CTAGCCAGTT AAGAAGTATC GTACGACTTG GGAGACCGTA TTCAAGAGAA 606 0 

ACTCTATCTT TAGTCCAGCC TTCATGTCAG ACTTTATTAA CCCCAATTAT TCACCCCAAA 612 0 

TCTAAAAACC ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC 618 0 

CAATGGGTGT TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA 624 0 

AAACACAAAA AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA 63 00 

GTCTTTTTAC CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT 63 60 

TTTTCAGGAA CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC 6420 

GAATGAmCAA CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCwTGTCC AGTTCCTCTT 6480 

TCAACTGTTA ACAGGTTATG GAACGGAATA TGCTTG 6516 
(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 14654 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 106: 

TTTTCAACCC ATATCGTGGC TCCTGAATAC TACTTACTGA CAACTATGCT ATCAGAGACT 60 

TCTCTACTTG TTTTCTATAT CATTTTCATC CATAGAAAAC AACTCATCCA CTTGGGACAT 120 

ATCTTTAGCT ATACTGTTCG ATACTCTCTC TTTTCACTTT CCTTTGTAGC AATTTATTTC 180 

CTGATTAATT TCGTGTATCC TGTAGATATG GTCATTAATT TGCCATTTTT GATTAATACT 240 

GGTTTGATTG TCTTGCTATC AGCTATCTCT TATATTAGTC TACTTGTCTT CACAAAAGAT 3 00 
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AGCATTTTCT ATGAATTTTT AAACCATGTC CTAGCCTTAA AAAATAAATT TAAAAAATCA 3 60 

TAGGAGTTTA AAATGAAACA ACTAACCGTT GAAGATGCCA AACAAATTGA ATTAGAAATT 420 

TTGGATTATA TTGATACTCT CTGTAAAAAG CACAATATCA ACTATATTAT TAACTACGGT 480 

ACTCTGATTG GGGCGGTTCG ACATGAGGGC TTTATCCCTT GGGACGACGA TATTGATCTG 540 

TCCATGCCTA GAGAAGACTA CCAACGATTT ATTAACATTT TTCAAAAGGA AAAAAGCAAG 600 

TATAAGCTCC TATCCTTAGA AACTGATAAG AACTACTTTA ACAACTTTAT CAAGATAACC 66 0 

GACAGTACGA CTAAAATTAT TGATACTCGA AATACAAAAA CCTATGAGTC TGGTATCTTT 72 0 

ATCGATATTT TCCCTATAGA TCGCTTTGAT GATCCTAAGG TCATTGATAC TTGTTATAAA 780 

CTGGAAAGCT TCAAACTGCT GTCTTTCAGT AAACATAAAA ATATTGTCTA TAAGGATAGC 84 0 

CTTTTAAAAG ATTGGATACG AACAGCCTTC TGGTTACTCC TTCGACCGGT TTCTCCTCGT 900 

TATTTTGCAA ATAAAATCGA GAAAGAAATT CAAAAATATA GTCGTGAAAA TGGGCAATAT 9 60 

ATGGCTTTTA TCCCTTCAAA ATTTAAGGAA AAGGAAGTCT TCCCAAGTGG TACCTTTGAT 102 0 

AAAACAATCG ATTTACCCTT TGAGAATTTA AGCCTTCCTG CACCTGAAAA ATTTGATACT 10 80 

ATTTTGACAC AATTTTATGG AGATTATATG ACCCTACCAC CAGAAGAAAA ACGCTTCTAC 114 0 

AGTCATGAAT TTCACGCTTA TAAATTGGAG GATTAGGATG CAATATTTAG AAAAAAAAGA 1200 

AATTAAAGAA ATTCAACTAG CCCTGCTGGA CTATATTGAT GAGACTTGTA AGAAACATGA 12 6 0 

TATTCCTTAT TTTCTCAGTT ATGGAACCAT GCTTGGAGCC ATCCGCCACA AAGGTATGAT 13 2 0 

TCCTTGGGAT GATGATATTG ATATTTCCCT TTATCGTGAG GATTATGAGC GTTTACTGAA 1380 

GATTATTGAA GAAGAAAATC ACCCTCGCTA CAAGGTTCTT TCCTACGATA CATCTTCTTG 144 0 

GTACTTCCAT AATTTCGCAT CGATTTTGGA CACTTCTACT GTTATAGAAG ACCATGTTAA 1500 

GTACAAGCGT CATGATACCA GCCTTTTCAT CGATGTCTTC CCAATTGATC GATTTACAGA 15 6 0 

CTTGAGCATT GTCGACAAGA GCTATAAGTA TGTGGCTCTT CGTCAACTAG CTTATATCAA 162 0 

AAAATCACGA GCAGTTCACG GTGATAGCAA ACTAAAAGAT TTTCTTAGAT TATGTAGCTG 1680 

GTACGCTCTC CGATTTGTCA ATCCTCGCTA CTTTTACAAG AAAATTGATC AACTAGTCAA 174 0 

AAATGCTGTA ACCAACACTC CTCAATATGA AGGAGGAGTT GGGATCGGTA AGGAAGGGAT 1800 

GAAAGAAATC TTCCCAGTTG ATACCTTTAA AGAACTGATT TTAACTGAGT TTGAGGGCCG 1860 

TATGTTGCCT GTTCCCAAAA AATATGACCA ATTTTTAACC CAGATGTATG GCGATTATAT 192 0 

GACACCACCA TCAAAAGAAA TGCAAGAGTG GTATAGTCAT AGCATTAAAG CTTATCGCAA 1980 

AAACTGATTG AGGGGGATTA TACAAACTAC TAAGATAGAG GTTATTCAAA AACATAATTT 204 0 

TAGTAGAAAA TGAAATACAT ATTCCCACAA TAAAACGCAT CATATCAAGG TTTTTGAAAA 2100 
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ACCTTGATAT GATGCGTTTT ATAATTTTAA AGACTTTTTT CTATAGTAGA TTGAAATAAG 2160 

ATGCGAACAA ATCAATTAGA AAATTCAAAT TAATTTATAG AAATATTTTA GTATTCCTGT 2220 

GTACTGTTCT AAATTCAGTC TGCTATATCT TATTTTTCTA TTTAAATCGC TTCTGTAACA 2 2 80 

AAGCTACGAC TTTCAAGTAC CTTAAGCATG GCATTAGCTG TATCTAGCGC TGTGAAGAGG 2 340 

GGCACCCCGT GTTCAATGGC TGAACGACGA ATTTGCTCAC CATCTTCGTC AGCAGTTCGT 2400 

TTTGTTCCTA CTGTGTTAAT GATAGCTTGA ATTCTTCCTT TGCGTACAAA ACTTGGGATA 2460 

TCCTTATCGT CATCACCAAT CTTACCAACA GGTTGGGCTT GCAAGCCATG ACTAGCAAAG 2 52 0 

AAGGCTGCTG TCCCTTCTGT CGCAAGGATT CCATAACCAA TGTTTTGGAA ACGACGAGCC 2 580 

AAGTTCAAGG CTTCTTCTTT GGCATCATCA GCGATGGTAA AGACGACATT ACCAAAAGTT 2 640 

GGCAAGTGTA GATAAGAAGC TTCAAAGGCT TTATAGAGAG CTTTTTCCAA AGTAGCATCA 2700 

GAACCCATAA CTTCACCTGT TGACTTCATT TCAGGACCGA GCAAGCTGTC TACCTTAGCT 2760 

AGTTTGGTAA AGGAGAAGAC AGGTGCCTTG ATATGAACAC GGGTGCTTTC AGGGTAAAGT 2 82 0 

CCATTTTGGT AGCCAAGTTC TGATAAACTT TGACCAAGAA TGAGTTTGGT CGCTACTTGA 2 880 

GCCATAGGAA TATTGGTTAC CTTAGATAGG AATGGAACAG TACGGCTGGC ACGTGGATTG 2 940 

ACCTCAATAA CGTAGACTTT TTCATCCTTG ATAACAAACT GGATGTTCAT CATTCCAAGG 3000 

CAGTGAAGAC CGATTGCTAA GCGTTTGGTG TAGTCTGCGA TGGTCTCCTG AACCTTTTGC 3 060 

GACAAGGTTT GTGGTGGGTA AACAGCCATT GAGTCACCTG AGTGGACACC AGCACGTTCG 3120 

ATATGCTCCA TGATACCAGG AATGAGTACA TTTTTACCAT CTGAAATGGC ATCAACTTCG 3180 

CACTCTTGCC CAACGATATA AGAGTCGACA AGAACTGGGT GGTCTGGACT AGCCTTAACA 3240 

GCAGTTCGCA TGTAAGAACG AAGGTCTTCT TCGTTTTCAA CGATTTCCAT GGCACGTCCA 3 3 00 

CCAAGTACAT AAGATGGGCG GACAAGAACT GGGAAGCCAA TCTTGCGAGC TGCAAGAGCT 3 3 60 

GCTTCTTCTT CATTGGTAGC CGTTTGTCCT GGTGGCTGTG GAATATCCAA TTCTTTGAGA 3420 

GCTTGCTCGA AGAGGTCACG GTCTTCGGCA CGATCTAGGT CAGCAACCTG TGTACCAAGG 3480 

ATGGTCACAC CTGCTTTTGC CAATGGCTCC GCAAGGTTGA TGGCTGTTTG ACCACCGAAC 3 540 

TGAACGATAA CTCCCTTTGG TTGTTCCAAG TCAATGACGT TCATAACATC TTCGAATGTC 3 600 

AATGGCTCAA AGTAAAGCTT ATCTGATACA GAGAAGTCTG TTGAAACGGT CTCTGGGTTT 3 660 

GAGTTCATGA TGATAGCTTC ATAACCAGCT GCCTGGATAG CCTTAACAGA GTGAACGGTT 3720 

GCGTAGTCAA ACTCAACCCC TTGACCGATA CGGATTGGAC CTGAACCTAG GACAAGTACA 3 7 80 

GATTCTTTAT CAGATCTGAT AGATTCATTT TCCCAACCAT AGGTTGAATA GAAATATGGC 3 840 
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GTTTCGGAGT CGAACTCTGC CGCACAAGTG TCTACCATCT TATAAACTGG AACAATCTTG 3900 

TTTTCCAAGC GAAGTTGGCG AACTTTATCA TCAGTCGTTC CCCAGAGTTC AGCAATCTTA 3 96 0 

CGGTCTGAAA AACCATTAAG TTTGGCTGTT TTCAAAACTT CTAAATCTTG TGGATGAGCA 4020 

CCCAATTCTT GCTCAATTTC AAAGATATGC AAGAGTTTAT CAAGATAGAA GATATCAATT 4 080 

TTTGTAAGCT CTGCAATTTC TTCAGGTGTG TAGCCACGAC GAATGGCTTC TGATACGTAG 4140 

AAGAGACGGT CATCTTGGGC TTTGACAACC TTTTCAATCA AGGCATCATC AGAAACTGCT 4200 

GCAAGTTCAG GTATTTCATT GTGGTGCACC CCAATTTCAA GGGAGCGGCA GGCCTTGAGA 4260 

AGAGATTCCT CGATGTTACG ACCGATTGCC ATGACTTCTC CAGTCGCCTT CATTTGTGTA 4320 

CCGAGACGGC GTTCACCCTT TTCAAACTTG TCAAATGGGA AACGTGGAAT CTTAGCAACT 43 80 

ACGTAGTCAA GGGCTGGTTC AAACATGGCA TAGGTTGAAC CTGTAACTGG GTTTATAACC 4440 

TCATCCAAGG TCAAACCTAC TGCAATCTTG GCAGCCAACT TAGCAATCGG ATATCCTGTC 4 500 

GCTTTAGAAG CAAGGGCTGA CGAACGTGAT ACACGAGGGT TTACTTCGAT AACATAATAC 4560 

TTGAAGCTGT TAGGATCAAG AGCTAGCTGA ACATTACATC CACCTTCAAT CTTGAGGGCA 4 620 

CGAATAATGC TCAAGCTCGC ATCACGAAGC ATTTGGTTTT CATAGTCTGA CATGGTTTGC 4 6 80 

GCAGGGGCAA ATACAATGGA ATCCCCTGTG TGAATCCCAA CTGGGTCAAA GTTTTCCATG 47 40 

TTACAAACAA CCAAGGCATT GTCAGCTGAG TCACGCATCA CTTCGTATTC AATTTCCTTG 4 800 

AAACCGGCAA TCGAACGCTC AATCAAACAT TGGGTAACAG GTGACAATTT CAAACCATTT 4860 

TCAGTGATTT CACGCAATTC TTTCTCGTTG GCACACATAC CACCACCAGT ACCACCAAGG 4920 

GTAAAGGCTG GACGAACGAT GACTGGGTAG CCAATTGTCG CTGCAAAGGC AACTGCTTCT 4 980 

TCTACTGTGT TAACAATTTC AGATTCTGGA ATGGGTTGTT CAAGCTCTTC CATCAATTGT 5040 

TTAAAGAGGT CACGGTCCTC CGCTTGGTCA ATGGCAGATA ATTTGGTACC CAGAAGTTCA 5100 

ACGCCAAGCT CGTCTAGGAT ACCATTTTTA GATAATTCCA TGGCCATGTT GAGACCTGTC 5160 

TGACCACCGA GTGTTGGTAG CAAGGCATCT GGACCTTCCT TACGAAGAAT ACGTGTCACA 52 2 0 

AACTCAAGTG TAATCGGTTC AATGTAAACC TTGTCAGCAA TTTCCTTGTC CGTCATGATG 52 80 

GTTGCAGGAT TTGAGTTAAC CAAAACAACC TCATAACCTT CCTCTTTCAA CGACAAGCAA 5340 

GCCTGAGTCC CAGCGTAGTC AAACTCAGCA GCCTGACCAA TAATAATCGG ACCAGAACCA 5400 

ATCACCATAA TTTTTTGAAT ATCAGTACGT TTAGGCATAT ATAAGATATT AAGGGTGTCA 5460 

AGCGGACAAA GCTAAAATAG GAGTTATGAC GAAGAACTGT CAGTTCTAGG AATAACTATC 552 0 

TTTTTAGCAC CGTCCGTAGC CCGTATTCAG TTCAGCAAAT ACGGAGCACC CTTCTCCTTT 5 5 80 

CTATTCGTCG CCTCTCAGGG CGACATTAAA TAAGATACAA AGGACGAATA GAAAGCGATT 5640 
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GAATTTTAGG AAATCAAGGA AGGATTGACA ATCCAAGTTG GTTTCTCTAC ATTCTGAGCT 57 00 

TTCCGTCCGT GTTCAGTTAC ATAAATTCTC CGACGAGCTT TTACTCGTTC TTAGTTTGAT 57 60 

TGTTTAAAAA CTTCCATCAT CTCGATAAAC TCGTCAAATA GGTAGCTAGC GTCGTGTGGC 582 0 

CCAGGAGCTG CATCTGGGTG GTATTGAACA GAGAAAGCAG GTTGGTATCT GTGGCGCACA 58 8 0 

CCTTCCACTG ACTTGTCATT GATTTCTTCG TGGGTAATAA TCAAGTGCTC TGGCAAATCC 5940 

TCGCGGCTGA CTGCATAACC ATGGTTCTGG CTGGTGAAGT CTACTCGTCC TGTTGCGATT 6000 

TCACGTACCG CATGGTTGAA TCCACGGTGG CCAAACTTCA TCTTATAGGT CTTAGCCCCG 6060 

TTTGCCATTG CAAAGAGTTG GTGTCCCATA CAAATACCAA AGATTGGAAT TTTTCCTTGT 612 0 

ACACCGCGAA TCATGTCGAG TGCTTGTGGA ACGTCTTCTG GGTTACCTGG ACCATTTGAC 6180 

AACATAACTC CGTCAGGATT GAGATGGAGA ATTTCTTCAG CCGTTGTCGA ATAAGGAACA 624 0 

ACTGTCACGT TACAGTTGCG TTTAGAAAGT TCACGTAGGA TTGAGTGCTT GAGACCAAAG 63 00 

TCCACTAGCA CCACGCTCAA ACCAACTCCT GGAGCTGGAT AAGACGTTTT AGTAGAAACC 6360 

TGTTTGATAT TGTCTGTCGG TAAAACTGTT GCTTGGAGCT GGTCCGTCAC ATGGTCCATA 642 0 

CTGTCCCCAA CATGGGTCAA GGTTGCACGC ATAGTACCAT GCTTACGGAT AATCTTGGTA 64 8 0 

AGAGCACGCG TATCAATTCC TGAAATCCCT GGAATTTTCT TGGCTTTCAA AAATTCATCC 6 540 

AAGGTCATTT GGTTGCGCCA GTTGCTAGCT CTACGCGCTT CTTCAAAAAC AACGACTCCC 6 600 

TTACAAGTTG GAATAATGGA TTCATAATCA TCACGATTAA TACCATAATT TCCTACCAAA 6 660 

GGATAAGTAA AGGTCAAGAT TTGTCCATTA TAAGACTGGT CTGTAATGGA TTCTTGGTAG 6720 

CCGGTCATCC CTGTATTAAA GACGATTTCG CCTGTTACAT CAATATCTGC TCCGAAGGCC 67 80 

TTGCCTTCAA AAACTGTGCC ATCTTCTAAT ACTAGAATTC TTTTTGTCAT ATTTTCACCT 6840 

CTCGTGGACG CTCACTGGCG TCTTTTAACG TCTTGTGTTT TAGTTGGCGT TTCTACTCGC 6 900 

TAGTACGGAT TCTAAGATTG CCATTCGAAC AAAGACACCA TTGGTCATTT GTTGGACAAT 6960 

CCGTGATTTT GGTGCTTCAA CCAAGTGGTC TGCTATTTCT ACATCACGAT TGATTGGAGC 7 020 

TGGGTGCATG AGGATTGCTG TTTCTTTCAA ACGATCGTAA CGTTCTTGAG TCAAGCCATG 7080 

TTGGGCATGG TAGTCTTCTT TTGAAAATAC AGCTCCACTA TCATGGCGTT CGTGTTGCAC 7140 

ACGGAGAAAC ATCATGACAT CAACCTGATC AATGATTTCA TCAATGGTTA CAAACTGTCC 72 00 

ATAGTCTGCA AACTCTTGAC TTCTCCATTC CTCAGGTCCA GCGAAAAAGA GTTCAGCTCC 72 60 

CAAGCGTTTC AAAATCTGCA TATTGGATTT GGCAACGCGT GAGTGGTCCA AGTCACCTGC 7 3 20 

AATAGCAACT TTAAGACCCT CAAAGTGGCC AAATTCCTCA TAAATGGT CA TCAAATCAAG 7380 
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CAAGCTCTGG CTAGGGTGTT GGCCCGAACC ATCTCCACCA TTGATGATGG AAGTCGTAAT 7440 

CGTTGGACTA GCAATCAATT CTCTATAGTA GTCGACCTCT GGATGGCGAA TCACACAGAC 7500 

ATCCACTCCT AAAGCAGACA GAGTCAAAAT GGTGTCATAA AGTGTCTCAC CCTTATTAAC 7560 

CGAGCTAGTC TTCACATCAA AGTCAAGTCG TTCCAATCCA AGTTTAATCT CTGCGACTTC 762 0 

AAAGGACTTA TGTGTCCGTG TAGAATCCTC AAAGAAGAGA TTGGAAACAA TCGGATGGTC 7 680 

TTCATAGGGA AGCTGGGCTC CATTTTTAAA CTCAATTCCT CGCTTGATCA ATTTCATTAC 774 0 

TTGATCGACA GTGAGGTCTT CCATGGACAC CACATGGTTC AATGCTTGTT GATTTTCTGA 7800 

CATGGCTACT CCTTTAACTT TCTAAGCTTC TTCAGTAATC AGAACTCTGT CTTGGTCATC 7860 

AAGTTCTGTC ATCTCTACGA TGATTTCTTC AGAACGACTG GTTGGGATAT TTTTTCCAAC 792 0 

GTAATCTGGA CGGATTGGCA ATTCTCTATG TCCACGATCG ACTAGAACTG CTAAACTCAC 7 980 

ACGCGCAGGA CGACCATGAC CGACAATATT ATCAATAGCA GCACGGATGG TACGACCTGT 804 0 

ATAGAGCACA TCATCCACCA AGATAACTTC GCGGTCTGTC ACATCGACAG AAACCAAAGA 8100 

AGTATCTTCT CCACTTTTAA CATCATCACG GAAAGGTTTA GTATCCAATT CCACAACAGG 8160 

AACTGAAAGA TTTTCTAACT GCTTCAAACG TTCTTGGATT CGGTGGGCAA TAAAGACACC 8220 

ACGAGTTTTA ATACCAGCCA AGACGATCTT ATTCAAATCT TTGTTGCGTT CGATAATCTC 8280 

ATAAGTAATA CGCGTAATCG CTCGTTTGAC GGTCAATTCG TCTACAACTT CTTTTGTTTT 8 340 

CATGACAAAC CTCCAAAAAG AAAAGTCTCC TTAAACAAGG AGACTTGAAA TTTATAGCCA 8400 

AGCGAGCCCT ACTGCACACA GTATAGACTT CACCCTTCTA CTTTATCGCG CTCCTTGCCT 8460 

GCCTCACGGG ACAGGTTTAA AGGAATATTT AGTTATCATT TACTATAGCA CAAAGCATGC 8 520 

TTAAAATCAA GCAAAAAGTT TCAATGTAGC ATCTTACAAA TTGCTAAAAT CATATAATTG 8 580 

TGGGTACTGG TCACACTCTG GATTTTTTGG ATGGCAAATG GCTCTTCCAA AATAAATCAT 8 640 

GGCCTGATGG GCAGCTAACC ACTGCTCAGG CGGCAAGATA TCCATGACCC GCTTTTCCAC 8700 

CTCAAGTGGC GTCGCTGATT TTTTGACAAT ATCGTGGTGT TTGCAAATAC GCTCCACATG 8760 

AGTATCCACT GCAAAGGCTG GAATTCCAAA TCCTACACTC ATGACAACAT TGGCTGTCTT 8820 

GCGACCAACA CCTGCCAAAC TCTCCAATTC TTCACGTGTC TGAGGGACTT GACCATCAAA 8880 

ATCGTCTAGT AACTGTTGGG CACATTTTTT AAGGAATTTA GCTTTATTCC GATACAATCC 8 940 

CAAGCGAGAA ATATGTGAAG CAATCTCACT CTCTGTCGCT ACAGACATAG CTTGGGGTGT 9000 

TGGAAAGGCA ACAAAGAGAC CTGGTGTGGC CTTATTTACC GCTGCATCTG TCGTCTGGGC 9060 

TGATAACATG ACCGCAACCA GGAGTTCAAA ATGATTGGTA AAATCAAGAC TAGGCTTGGC 9120 

ATCTGGGAAG AGGGCAATGA TTTCTTCTAG CACCTTTCGT GCTCGTTTTT TTGACAAGAC 9180 
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CATTATTCAT CTCCGTCAAA TAGTCCTTGT AAGCCAGCAA AAGGACTGTT TTCTTCTTTC 92 4 0 

TTTACTGCTT TTTGAGCTTG GTATTCTTCC TCTGTCATGA TTTGCCAGTC ATTTCCTGAG 9300 

ATAAATCCTT GACCAGCTTC TTCTTCAGCC GTCAAGACCT TGATAGGAAT GTTTAGCAGG 9 3 60 

ATATTGTCTG ATACACTCTC AGCAAGGTCA AGCTCCCCAT TTTCGATGGG CAAGACCAAG 942 0 

TCATCATCTA AAACTTCTTG ATCTAGCTGG TTAGTTGCGC CTTCCATGAA AACTTCCGTG 9480 

ACTGGATAAG ATTCAACTAA CTCAACTGGC TCCATACTGC GACTCGACGC AAGAACAATG 9 54 0 

GTATAAGATA GTTGATAATC TAAGAAATAC ATACGGTCTT CATATTGTAC TTTCCCAACT 9600 

GCAAGGATAT CTTTTACATC TAAAATTTCT TGATTACGTG CACGCAGGTC ATCAACTAAA 9 6 60 

TCTAACGTTT GTTCAAAGTT CAAACCTTCA GACTGCTTAC GAATTTCTTG AATATTTAAT 9720 

TTCATACTTC CTCCATAAAG ATTTACTCTC TTGATTATAC CATGAAAAGG CTACAAATCA 97 80 

GCACACCAAA CTTTGTAATT AAAATTCAAA ATTTTAACAT ATTTACTATG ATAGTTTTAT 984 0 

TTTTTAGTGC TATACTATAG GGAAAGAGTA CATCAGATCA AGGAGGATGC TCACATGGAA 9900 

GACAAGAAAC TCATTCAACT CCTATCCAAG TTAAATAAAA GCTACCAAAA CTGTAAACAG 99 60 

GGTACGGCAG ATGATATTCG ACTACAAGAG CTGCTAAACA CTACTATGCA AGAGCTCAAA 1002 0 

AAAACGGAAC AGTTGAACAA CAGTATCTTA ATTGATCTTG AGAAATTTTA CCAACCTACC 10080 

AGTCTTCTGA TTGGACTGGG TAGCCTAAAA CTAAACGATC AAGCACGCAC TGCTTGGCGA 1014 0 

AACTATGATA AATTCCATTA CGATCATGTC AAACACGTAC TAAGTCTCTA TGGACCTGTT 10200 

TTTGAATTTT AGAGCATAGA ATTTCCAGTT TTCTGTTGAC AAAATTTCCT TAAAGGTATA 10260 

ATATAAAGAT ACTAATACTC GGAGGTAAGG GAGACATGAA CAACTAAGTC TATCAAATAA 10320 

AGAACCTTTA TTTAGTAGAT CTTGTTTTTG TCTCTTTTTG TGTGCTCTTT TATGCTCTTT 10380 

TTCTGGCATG TTAATAGAGT TTTTTTGACA TAGACTTTGG GCTCTACTAG GTAAAGTAGA 1044 0 

GCTTTTTGTT ATGCACTATG AACATTCTAG AAAGGGAAAT CATATGATAA AAATCAATCA 10500 

TCTAACCATC ACACAAAACA AAGATTTACG AGATCTTGTA TCTGACCTAA CCATGACCAT 10560 

CCAAGACGGG GAAAAGGTTG CTATTATTGG TGAAGAAGGA AATGGCAAAT CAACCTTACT 10620 

TAAAATTTTA ATGGGGGAAG CTTTGTCTGA TTTCACTATC AAGGGAAACA TCCAATCTGA 10680 

CTATCAGTCA CTGGCCTACA TTCCTCAAAA AGTCCCTGAG GACCTAAAAA AGAAAACTTT 10740 

ACACGACTAC TTCTTTTTAG ATTCTATTGA TTTAGACTAC AGTATCCTCT ATCGTTTGGC 10800 

GGAGGAATTG CATTTTGATA GCAATCGTTT CGCAAGTGAC CAAGAGATTG GCAATCTATC 10860 

AGGGGGCGAA GCTTTGAAAA TTCAGCTTAT CCATGAGTTA GCCAAACCCT TTGAGATTCT 10920 
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ATTTTTAGAT GAACCTTCAA ATGACCTAGA CCTTGAGACA GTTGATTGGC TAAAAGGCCA 109 80 

GATTCAAAAG ACCAGGCAAA CCGTTATTTT CATTTCCCAT GATGAAGACT TTCTTTCTGA 11040 

AACGGCAGAC ACTATTGTTC ACTTGCGACT GGTCAAACAC CGTAAAGAAG CGGAAACGCT 11100 

AGTAGAGCAT TTAGACTATG ATAGCTATAG TGAGCAGAGA AAGGCTAATT TTGCCAAACA 11160 

AAGTCAGCAA GCTGCTAACA ACCAAAGAGC CTACGATAAA ACCATGGAAA AACATCGGAG 11220 

AGTTAAGCAA AATGTAGAAA CTGCGCTTCG AGCTACCAAA GATAGTACTG CCGGTCGCCT 112 8 0 

ATTGGCTAAA AAGATGAAAA CTGTCCTCTC ACAAGAAAAA CGCTACGAAA AGGCAGCTCA 11340 

GTCCATGACT CAAAAGCCAC TTGAAGAGGA ACAAATCCAA CTTTTCTTTT CAGACATCCA 11400 

ACCATTACCA GCTTCTAAAG TCTTAGTCCA ACTGGAAAAA GAAAATTTGT CCATTGACGA 11460 

CCGAGTTTTG GTTCAAAAAC TACAACTAAC TGTCCGTGGC CAAGAAAAAA TCGGTATTAT 1152 0 

CGGGCCAAAT GGTGTTGGGA AATCAACTCT GTTAGCCAAG TTACAGAGAC TTCTGAATGA 115 80 

TAAAAGAGAG ATTTCACTTG GTTTTATGCC ACAAGATTAC CACAAAAAAC TGCAATTGGA 1164 0 

TTTATCCCCA ATAGCCTATC TCAGTAAAAC TGGGGAAAAA GAGGAACTAC AGAAAATCCA 117 0 0 

ATCTCACCTA GCTAGTCTCA ATTTCAGTTA TCCAGAAATG CAGCATCAAA TTCGCTCCTT 117 60 

ATCTGGCGGA CAACAGGGAA AACTCCTGCT TTTGGATTTA GTCCTGCGCA AACCAAACTT 11820 

TCTCCTGCTG GATGAACCCA CACGAAACTT TTCTCCCACT TCTCAACCCC AAATCAGAAA 118 80 

ACTCTTTGCT ACCTATCCAG GCGGTCTCAT CACTGTTTCG CATGACCGTC GTTTCTTAAA 11940 

AGAAGTCTGC TCGATCATCT ATCGCATGAC AGAACACGGT TTGAAGCTAG TTAATTTAGA 12000 

AGATTTATAA ATTTGCAACA TAGCAAAAAT CCAGAGACGA CCTCTGGATT CTTTTACATC 12060 

TGTTTTAAAC GTTCAATCCG TTCTGAGATA GGTGGGTGGG TATAAAAGAG TTTTTGGAAC 1212 0 

CCCCCACCTT TCTTAGGATC ATTGATATAA AGGGCACTGC TAGCATCATC GACGTGGCGA 12180 

CTCATAGGTT TGCTATTGTC CAACTTATCT AGGGCATTAA TCATTCCCTG GGGATTGCGA 12240 

GTCAGCTCGA CACTAGATGC ATCTGCCAGA AATTCCCTCT GACGAGAAAT AGCGAGCTGA 12300 

ACCAAGGTTG CAGCGAGAGG TGCCAGTACA ATAGCTAGTA GGGAAACCAC TAGCATAATG 12360 

ATTTCAAGAC CATTTCCATC TCGGTCATCA TCACTTCGTC TGCGACCTGC TCCACCCCAC 12420 

C AC AT CAT AC GACCTGCCAT ACTAGAAAGC ATGGTGATAG CACTAGCAAG GGCAACTGCA 12480 

ATAGTCGAAA TACGGATATC ATAATTACGA ATATGACTGA CTTCATGTCC CATAACAGCT 12540 

TCTAGTTCTT CACGATTCAT GATAGCTAGT AGACCTGAAG TCGCAGCAAC AGCCGCATTT 12600 

TGAGGATTAG AACCTGTCGC AAAGGCATTT AAGGCTGGAT CATCAATGAT GAAAACACGG 1266 0 

GGCATAGGAA TCTGAGCGAC CAGAGCCATA TCTTCCACTA CATGGTAGAG GTCTGGTGCC 12720 
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GTTTGCTCAT CCACCTCACG CGCTCCATTC ATGGACATGA CAATCTCTGT CGATTGAAAA 12780 

ATCATAGACA AAGCGTAGAT AAAGCCGATA ATCAGTGCAA TAACCAAACC ACCAAGTCCA 12840 

GATCTTATAA AGAGATAACC AACCGCATAA CCAACAAGAG CTAAGAGTAG GAAAAATACC 12 900 

AGCAACAAAA TCCAGGTTTT TCGTTTATTG CTTGCAATTT GATCAAACAA CATCTTAGTC 12960 

ACCTAAACCG CTAAAATCAA CTTTAGGAAC CGACTTTTCC TCTTCAGGTG TTTGAAGGAA 13020 

ATCTGCCGCT TTAAATCCAA ACATTCCAGC GATAATATTG CTCGGGAAAG TTTCTAATTT 1308 0 

TACATTGTAG TTGCTGACAA CACTGTTATA GAGTTGACGA GAGTAAGAAA TTTTATTTTC 13140 

TGTGTTTGTC AACTCCTCTT GCAATTTAAC AAAGTTAGCA CTAGCTTTCA AATCTGGATA 13200 

GCTTTCTGCA ACTGCAAAAA TACCTGAAAC CTGACGAGTG AGGGCATCAC TGGCTTTCAT 13 2 60 

AGCTTCTGCT GGTGAAGTCG CTGCCGCCAC TTGGTTACGT AGTTCTGCCA CCTTTTCAAG 1332 0 

GGTAGAACCT TCATATTTGG CATAACCTTT TACAGTCTCA ATCAAGTTTG GCAAGAGGTC 13380 

ATTGCGACGT TTCAACTGAA CATCAATCTG ACTCCAAGCC TCCTTGGTTT GCATACGATT 13440 

TTTAACCAAA CCGTTATAGC TAACAATCAC AAAAATAACA ATAAGAGCGA TAACTCCAAG 13 500 

AATAATCCAA GTCATAATAT AAGTCCTTTC TGCTTTTAGA TTAGTACCAG TATATCAAAT 13 560 

TTTCTATGAT TGTGGTAAAA TAAGATGATA CTAAAGAAGG AAATAACTAT GAAACCAAAA 13 620 

ACATTTTACA ACTTGCTTGC CGAGCAGAAT CTTCCACTTT CGGACCAGCA AAAAGAACAA 13 680 

TTTGAACGTT ATTTTGAGCT CTTGGTCGAG TGGAATGAGA AGATTAATTT GACGGCGATT 13740 

ACGGACAAGG ^AGAAGTTTA TCTCAAACAT TTTTACGATT CGATTGCACC CATTCTTCAA 13 800 

GGTTTGATTC CCAATGAAAC TATCAAACTT CTTGATATCG GGGCTGGGGC AGGATTTCCT 13 860 

AGTCTACCAA TGAAAATTCT CTATCCGGAG TTAGATGTGA CCATTATTGA TTCACTCAAT 13 920 

AAGCGCATCA ACTTCCTACA ACTCTTGGCT CAAGAACTGG ATTTGAACGG AGTTCATTTC 13 980 

TACCACGGAC GTGCCGAAGA TTTTGCCCAA GACAAGAACT TCCGTGCTCA ATATGATTTT 14 040 

GTAACAGCTC GTGCGGTTGC CCGTATGCAG GTCCTATCTG AATTGACTAT TCCCTACCTT 14100 

AAGGTTGGTG GCAAACTATT AGCACTCAAG GCTAGCAATG CGCCTGAGGA ATTATTAGAA 14160 

GCTAAGAATG CCCTCAATCT CCTTTTTAGT AAGGTCGAAG ACAATCTCAG TACGCCCTAC 14220 

CGAATAGAGA TCCGCGCTAT ATCACAGTGG TAGAAAAGAA AAAAGAAACA CCAAATAAAT 14280 

ATCCACGTAA GGCTGGTATG CCAAATAAAC GCCCACTTTA AATTTTTTAG TAAACAAATG 14 340 

TTTACAAAAT CAGCCTCGCT CTTTTATTTC TAGGCTCGGG AAAAAATGAT TTACAAAATC 14400 

AGCCTCGCTC TTTTATTTCT AGGCTCGGGA AAAAATGATT TACAAAATCA TTTTTTTCTG 14460 
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CTATACTATC CTAAGCAAAG GTTTTTAATG TCATCCCGTG AGGTGACGAA GACGCAGAAA 14520 

TATTTAAAAC TCTTTAAAAT CTAAATTTTA AAGAAGTCTT ACTCTGAGGG CCTATTGCTG 14 580 

TAAAATAATG GGCTCTTTTT TGATGCCCAA AAGTGAGGTT TATATGAAAC AAGAATCAAC 14 640 

TGTTGATTTG TTAC 14 554 
(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6405 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 

AGAAAAATCT GCTTTACAGA AAATAAAAAT AATAGGAGAA AATCTATGTC AGATTTGAAA 60 

AAATACGAAG GTGTCATTCC AGCCTTCTAC GCATGTTATG ATGATCAAGG AGAAGTAAGC 120 

CCAGAACGTA CGCGTGCCTT GGTTCAATAC TTCATTGATA AAGGTGTTCA AGGTCTTTAT 180 

GTCAATGGTT CTTCTGGTGA ATGTATCTAC CAAAGCGTTG AAGATCGCAA GTTGATTTTG 240 

GAAGAAGTCA TGGCGGTAGC AAAGGTAAAT TGACCATTAT TGCCCATGTT GCTTGCAATA 300 

ATACTAAAGA TAGTATGGAA CTTGCTCGCC ATGCTGAAAG CTTGGGAGTA GATGCTATTG 360 

CAACGATTCC ACCAATTTAT TTCCGCTTGC CAGAATACTC AGTTGCCAAA TACTGGAACG 420 

ATATCAGTTC TGCAGCTCCA AACACAGACT ACGTGATTTA CAACATTCCT CAATTGGCAG 480 

GGGTTGCTTT GACTCCAAGC CTTTACACAG AAATGTTGAA AAATCCTCGT GTTATCGGTG 540 

TGAAGAACTC TTCTATGCCA GTTCAAGATA TCCAAACCTT TGTCAGCCTT GGTGGAGAAG 600 

ACCATATCGT CTTTAATGGT CCTGATGAGC AGTTCCTAGG AGGACGCCTC ATGGGGGCTA 6 60 

GGGCTGGTAT CGGTGGTACT TATGGTGCTA TGCCAGAACT CTTCTTGAAA CTCAATCAGT 720 

TGATTGCGGA TAAGGACCTA GAAACAGCGC GTGAATTGCA GTATGCTATC AACGCAATCA 7 80 

TTGGTAAACT CACTTCTGCT CATGGAAATA TGTACGGTGT CATCAAAGAA GTCTTGAAAA 840 

TCAATGAAGG CTTGAATATT GGATCTGTTC GTTCACCATT GACACCAGTG ACTGAAGAAG 900 

ATCGTCCAGT TGTAGAAGCG GCTGCTGCCT TGATTCGTGA AACCAAGGAG CGCTTCCTCT 960 

AATCTAAAAG GAGGTATTTA TGACATATTA CGTTGCAATT GATATCGGTG GAACCAACAT 1020 

CAAGTATGGT TTGGTTGATC AAGAGGGGCA ACTTCTTGAA TCGCATGAAA TGCCAACTGA 1080 

GGCGCATAAG GGTGGACCTC ATATCTTACA AAAGACCAAA GATATCGTAG CTAGTTATTT 1140 

AGAAAAAGGC CCAGTAGCAG GTGTTGCCAT ATCTTCTGCT GGGATGGTGG ATCCGGATAA 12 00 
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GGGTGAGATT TTCTATGCTG 
GGAAATCGAA GAAAGCTTTA 
TCTTGCTGAG GCAGTATCTG 
TGGAACCGGT ATCGGTGGTT 
TTCAGCCTGT GAAGTCGGGT 
TACAACAGCT TTAGTGAAAT 
TGGCCGTAGA ATTTTCAAAG 
CCGTATGGTT GACTATCTAG 
AGTGGTTATT CTTGGTGGTG 
TACAGCCTTG AAAGAGGCTT 
CCATCACCAA AATACAGCAG 
CTAGTTTGGC TCAGCCAAAC 
GTTTTTTATT TTCCCAGTAG 
AAAAGTAGGC ATAAATCAAA 
GGATATAGGA GGGTTGGCTG 
CTTTATTGGG AACACTTGTA 
GACTATCTAA AATAGAAGGA 
CATCCATGAT GCTGGTTGTC 
CACCTAGTCG CATAAAACGT 
CAAAGAAGTA GATACGCTCA 
CGTCAATCAA GTCTTGTGTT 
TCAGTGGACT GTGCTTGGAG 
AAATAAATTC TCGGTAGCCA 
GAGAAATATG TAATTTTTGG 
CTTGCAAAAA ATAGCGAGCG 
CAATGATAGT TGCGATATCT 
GGAAGACTAG ATCAGAGAAT 
ATAAAAACGC ATCTCTGTTT 
ATTTTCTACT CAAATTGTGG 



GGCCGCAAAT CCCTAACTAC GCAGGCACCC AGTTCAAAAA 12 60 

CTATTCCTTG TGAGATTGAA AATGATGTCA ACTGTGCAGG 1320 

GTTCAGGCAA GGGAGCAAGT GTGACACTTT GCTTGACCAT 13 80 

GCTTGATTAT GGATAGGAAA GTCTTCCATG GTTTTAGCAA 1440 

ATATGCATAT GCAGGATGGA GCTTTTCAAG ACTTGGCTTC 1500 

ATGTAGCTGA AGCCCATGGA GAAGATGTTG ATCAGTGGAA 15S0 

AAGCCACTGA AGGAAACAAA ATCTGCATGG AAGGTATTGA 1620 

GAAAAGGTCT GGCAAATATT TGCTACGTTG CCAATCCAGA 16 80 

GTATCATGGG GCAAGAGGCT ATCCTCAAAC CTAAGATCCG 1740 

TGGTACCAAG TTTAGCAGAA AAAACACGAT TAGAATTTGC 1800 

GGATGTTGGG TGCATATTAT CATTTTAAGA CAAAACAATC 18 60 

TAGGATTTTC TTACACGTTT TTGTCTACGA TAGCCGTTGA 1920 

CTATTAAAGA TTTTTTCCTT GCTTTCGCGA TTGATTTCCA 19 80 

TCGATAAAGA AGAGCATAGG AAGTTGAGCG GATATTCGTT 2 040 

TGGGTGGCTA CAAGAACAGT CTCTGTATAG GTCTGGCTAT 2100 

AAGAGTACAG TCTTTGCCCC CATCTCCTTA GCATCTAATA 2160 

GTTGAGCCTG AAAGTGAGAA GCCCAGTACT AGACAATTTT 2220 

CAGGCAAAGC CGTCTTGGTC TGTCAAAGCT TCGCAGACCA 2280 

AATTTCATTT CACGGGCGAC GAGGCCAGAA CTCCCTGTTC 2 34 0 

GCATCTTCGA TTAGCTGGGC AATTCGTTCT AGTTGGATTT 2400 

TGTTCCCTCA TATTGCTATA ACTTCTGAGG ACTCGTTTGG 2460 

ACTTGGTTGG CTTGATTTTC TGCCTGATGT TGGTATTGGA 2 52 0 

GTAAAGCCAC ACTTTTTAGC AAAGCGGGTC AAAGCAGCTT 2580 

GTGACTTGTT GAGAAGATAA ATCATCTGTA ATCGTTTCAG 2 640 

ATTTCTTGTT CTAGGTCTGT CATTTCTTCA AAATGTGAAT 2 700 

GGTTTGTCCA TAGGGAAAGC TCCTTTACAT GAGTCATACT 27 60 

AGTCACACTT CATTATAACA CATAATATAA GGATAGATAA 2820 

TAAAAACGAA AAAATCGAAA AAGCTTCTCT CTTTTCCATA 2 88 0 

TACAATTAAG AGTAAGATTT TAAGTTAGAA ATGAGACTGA 2 94 0 
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TTTGTATGAG AAAATTTAAC AGCCATTCGA TTCCGATTCG GCTTAATTTA TTGTTTTCAA 3 000 

TCGTCATTTT ACTCTTTATG ACCATTATTG GTCGTTTGTT GTATATGCAG GTTTTGAACA 3060 

AGGATTTTTA CGAAAAAAAG CTAGCTTCAG CTAGTCAGAC CAAGATTACA AGCAGTTCAG 312 0 

CCCGTGGGGA AATTTATGAT GCTAGTGGAA AACCTTTGGT AGAAAATACG TTAAAGCAGG 318 0 

TTGTTTCCTT TACGCGTAGC AATAAAATGA CGGCTACAGA CTTAAAAGAA ACAGCTAAAA 3240 

AGTTACTGAC TTATGTGAGC ATCAGTTCTC CAAATTTGAC AGAACGCCAG CTGGCGGATT 33 00 

ACTATTTGGC TGATCCTGAA ATCTATAAAA AAATAGTGGA AGCTCTCCCA AGTGAGAAAC 33 60 

GCTTGGATTC AGATGGCAAT CGTCTATCCG AATCAGAACT GTATAACAAT GCGGTCGATA 3420 

GTGTACAAAC GAGTCAACTA AACTATACAG AGGATGAAAA GAAAGAAATC TATCTTTTTA 34 80 

GTCAGTTAAA TGCTGTTGGA AACTTTGCGA CAGGAACCAT TGCGACAGAT CCTCTAAATG 354 0 

ATTCTCAGGT GGCTGTTATT GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA 3600 

CTTCTTGGGA TAGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTTGGG AGTGTATCCA 3 6 60 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAAGCCTA TCTTAAAAAA GGCTATTCTC 372 0 

TAAATGACCG TGTAGGAACC TCCTATTTGG AAAAGCAATA TGAAGAGACC TTACAAGGAA 3780 

AACGCTCGGT AAAAGAAATC CATCTGGATA AATATGGCAA TATGGAAAGC GTGGATACAA 3 84 0 

TTGAGGAAGG TAGTAAGGGA AACAATATCA AACTGACCAT TGATTTGGCT TTCCAAGATA 3 900 

GCGTGGATGC TTTACTGAAA AGTTATTTCA ATTCTGAGCT AGAAAATGGT GGAGCCAACT 3960 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCGGTT TTGTCTATGT 4020 

CAGGGATTAA ACATGACTTG AAAACGGGAG AGTTGACGCC TGATTCCTTG GGAACGGTAA 4080 

CCAATGTCTT TGTTCCAGGT TCGGTTGTCA AGGCGGCGAC CATCAGCTCA GGTTGGGAAA 4140 

ATGGAGTCTT GTCAGGAAAC CAGACCTTGA CAGACCAGTC CATTGTCTTC CAAGGTTCAG 4200 

CTCCCATCAA TTCTTGGTAT ACTCAGGCTT ACGGTTCATT CCCTATCACA GCGGTCCAAG 42 60 

CTCTGGAGTA TTCATCAAAT ACCTATATGG TCCAAACAGC CTTAGGTCTT ATGGGGCAAA 432 0 

CCTATCAACC CAATATGTTT GTCGGCACCA GCAATCTAGA GTCTGCTATG GAGAAACTGC 4380 

GTTCAACCTT TGGCGAATAT GGCTTGGGTA CTGCGACAGG AATTGACCTA CCAGATGAAT 4440 

CTACTGGATT TGTTCCCAAA GAGTATAGCT TTGCTAATTA CATTACTAAT GCCTTTGGGC 4500 

AGTTTGATAA CTATACGCCG ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 4560 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT AAGGGAGGAC 4620 

TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA TAAGGTCAAT ATATCCGACT 4680 

CCGATATGAG CATCTTGCAC CAAGGTTTTT ATCAGGTTGC CCATGGTACT AGTGGATTGA 4740 
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CAACTGGACG TGCCTTTTCA AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG 48 00 

CCGAAAGCTA TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 48 60 

CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC AATCTAACAA 492 0 

ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT GTATCAAAAA TACCATCCAA 4980 

TGAATTAGAA AGGAAATTAT GCTTTATCCA ACACCTATTG CCAAGTTGAT TGACAGTTAT 504 0 

TCTAAGTTAC CAGGTATCGG GATTAAGACG GCTACGCGTC TGGCCTTTTA TACGATTGGG 5100 

ATGTCTGCTG ATGATGTCAA TGAATTTGCA AAAAATCTCC TTTCTGCTAA GAGAGAATTG 5160 

ACATATTGTT CTATTTGTGG ACGTTTGACA GACGACGATC CTTGTTCTAT CTGTACTGAT 52 2 0 

CCGACTCGTG ACCAGACAAC AATTTTAGTT CTTGAGGATA GTAGAGATGT GGCAGCCATG 52 80 

GAAAATATCC AAGAATACCA TGGACTCTAT CATGTCCTTC ATGGCCTCAT TTCTCCTATG 534 0 

AATGGTATCA GTCCGGACGA TATCAATCTC AAGAGCCTTA TGACTCGTCT TATGGATAGT 54 00 

GAGGTTTCAG AAGTGATTGT GGCGACTAAT GCTACAGCGG ATGGTGAAGC GACTTCCATG 54 60 

TATCTTTCAC GTTTGCTCAA GCCGGCTGGT ATCAAGGTTA CGCGTCTAGC ACGAGGTCTC 552 0 

GCTGTGGGAG CGGACATTGA GTATGCGGAC GAAGTGACAC TCTTACGAGC CATTGAAAAT 55 80 

CGGACAGAGT TGTAAGTGTA GGCAAATTTA CGAACTCCAT TCATTTATAA AAAATCAAAG 564 0 

AGGCTGAAAA TCGTTCCTAT CGGCCTCTTT TTGTATAGTG TGATGAGTAG GCTCAGGTTC 5700 

AAGTTTTAAA AAACCAAGCA AATATGATAT ACTAAAGAGC GAGTATTCTA GTAGAATTAG 57 60 

GACAAATAAT ATGAAACAAA CGATTATTCT TTTATATCGT GGACGGAGTG CGGAACGCGA 582 0 

AGTCTCTGTC CTTTCAGCTG AGAGTGTCAT GCGTGCGGTC GATTACGACC GTTTCACAGT 5880 

CAAGACTTTC TTTATCAGTC AGTCAGGTGA CTTTATCAAA ACACAGGAAT TTAGTCATGC 5940 

TCCGGGGCAA GAAGACCGTC TCATGACCAA TGAAACCATT GATTGGGATA AGAAAGTTGC 6000 

ACCAAGTGCT ATCTACGAAG AAGGTGCAGT GGTCTTTCCA GTCCTTCACG GGCCAATGGG 606 0 

AGAAGATGGC TCTGTTCAAG GATTCTTGGA AGTTTTG AAA ATGCCTTACG TTGGTTGCAA 612 0 

CATTTTGTCA TCAAGTCTTG CCATGGATAA AATCACGACT AAGCGTGTTC TGGAATCTGC 6180 

TGGTATTGCC CAAGTTCCTT ATGTGGCTAT CGTTGAAGGC GATGATGTGA CTGCTAAAAT 624 0 

CGCTGAAGTG GAAGAAAAAT TGGCTTATCC AGTCTTCACT AAGCCGTCAA ACATGGGGTC 6300 

TAGTGTCGGT ATTTCTAAGT CTGAAAACCA AGAAGAACTC CGTCAAGCCT TAAAACTTGC 6360 

CTTCCGATAT GACAGCCGTG TCTTGGTTGA GCAAGGAGTG AATGC 6405 
(2) INFORMATION FOR SEQ ID NO: 108: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

CGAGCTCGGG TACCGGGATT TTAAGGAGTT TGATATGTAT AACCTATTAT TAACCATTTT 60 

ATTAGTATTA TCTGTTGTGA TTGTGATTGC AATTTTCATG CAACCAACCA AAAACCAATC 12 0 

CAGCAATGTA TTTGATGCCA GTTCAGGTGA TTTGTTTGAA CGCAGTAAAG CTCGCGGTTT 18 0 

TGAAGCTGTA ATGCAGCGTT TGACAGGGAT TTTAGTCTTT TTCTGGCTAG CCATTGCCTT 240 

AGCATTGACG GTATTATCAA GTAGATAAGA AAATAATGGG CAGGACTAGG TCTTTGCCTC 300 

TTTTTATTTT TAAAGGATGT TTGAGAAGGT TTTACAGTAA AAGAAAATTA AAAAATCTAG 360 

AAAGAAAATA TGAAAGATAG AATAAAAGAA TATTTACAAG ACAAGGGAAA GGTGACTGTT 42 0 

AATGATTTGG CTCAGGCTTT GGGAAAAGAC AGTTCCAAGG ATTTTCGTGA GTTGATTAAA 48 0 

ACCTTGTCCT TAATGGAAAG AAAGCACCAA ATTCGTTTTG AAGAAGATGG TAGTCTGACA 540 

TTAGAAATTA AGAAAAAACA TGAGATTACC CTCAAGGGGA TTTTTCATGC CCATAAAAAT 600 

GGCTTTGGCT TTGTTAGTCT GGAAGGCGAG GAGGACGACC TTTTTGTAGG GAAAAATGAT 660 

GTCAACTATG CTATTGATGG TGATACCGTC GAGGTAGTGA TTAAGAAAGT CGCTGACCGC 72 0 

AATAAGGGAA CAGCAGCAGA AGCCAAAATT ATTGATATCC TAGAACACAG TTTGACAACA 780 

GTTGTCGGGC AAATCGTTCT GGATCAGGAA AAACCTAAGT ATGCTGGCTA TATTCGTTCA 84 0 

AAAAATCAGA AAATCAGTCA ACCGATTTAT GTTAAGAAAC CAGCCCTAAA ATTAGAAGGA 90 0 

ACAGAAGTTC TCAAAGTCTT T AT C GAT AAA TACCCAAGCA AGAAACATGA TTTCTTTGTC 96 0 

GCGAGTGTTC TCGATGTAGT GGGACACTCA ACGGATGTCG GAATTGATGT TCTTGAGGTC 102 0 

TTGGAATCAA TGGACATTGT ATCCGAGTTT CCAGAAGCTG TTGTTAAGGA AGCAGAAAGT 1080 

GTGCCTGATG CTCCGTCTCA AAAGGATATG GAAGGTCGTC TGGATCTAAG AGATGAAATT 1140 

ACCTTTACCA TTGACGGTGC GGATGCCAAG GACTTGGACG ATGCAGTGCA TATCAAGGCT 1200 

CTGAAAAATG GCAATCTGGA GTTTGGGGTT CACATCGCAG ATGTTTCTTA TTATGTGACC 1260 

GAGGGGTCTG CCCTTGACAA GGAAGCCCTT AACCGTGCGA CTTCTGTTTA CGTGACAGAC 132 0 

CGAGTGGTGC CAATGCTTCC AGAACGACTA TCAAATGGCA TCTGCTCTCT CAATCCCCAA 1380 

GTTGACCGCC TGACCCAGTC TGCTATTATG GAGATTGATA AACATGGTCG TGTGGTCAAC 1440 

TATACCATTA CACAAACAGT TATCAAGACC AGTTTTCGTA TGACCTATAG CGATGTCAAT 1500 
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GATATCCTAG CTGGCGATGA AGAAAAGAGA AAAGAATATC ATAAAATTGT ATCAAGTATC 1560 

GAACTCATGG CCAAGCTTCA TGAAACTTTA GAAAACATGC GTGTGAAACG TGGAGCTCTC 162 0 

AATTTTGATA CCAATGAAGC GAAGATTTTA GTGGATAAAC AAGGTAAGCC TGTTGATATC 1680 

GTTCTTCGGC AGCGTGGTAT TGCCGAGCGG ATGATTGAGT CTTTTATGTT GATGGCTAAT 1740 

GAAACAGTTG CCGAACATTT CAGCAAGTTG GATTTGCCTT TTATCTATCG AATTCACGAG 18 00 

GAGCCTAAGG CTGAAAAGGT TCAGAAGTTT ATTGATTATG CTTCGAGTTT TGGCTTGCGC 18 60 

ATTTATGGAA CTGCCAGTGA GATTAGTCAG GAGGCACTTC AAGACATCAT GCGTGCTGTT 192 0 

GAGGGAGAAC CTTATGCAGA TGTATTGTCC ATGATGCTTC TTCGCTCTAT GCAGCAGGCT 19 80 

CGTTATTCGG AGCACAATCA CGGCCACTAT GGACTAGCTG CTGACTATTA TACTCACTTT 2 040 

ACCAGTCCAA TTCGTCGTTA TCCAGACCTT CTTGTTCACC GTATGATTCG GGATTACGGC 2100 

CGTTCTAAGG AAATAGCAGA GCATTTTGAA CAAGTGATTC CAGAGATTGC GACCCAGTCT 2160 

TCCAACCGTG AACGTCGTGC CATAGAAGCT GAGCGTGAAG TCGAAGCCAT GAAAAAGGCT 222 0 

GAGTATATGG AAGAATACGT GGGTGAAGAG TATGATGCAG TTGTATCAAG TATTGTCAAA 22 8 0 

TTCGGTCTCT TTGTCGAATT GCCAAACACA GTTGAAGGCT TGATTCACAT CACTAATCTG 23 4 0 

CCTGAATTTT ATCATTTCAA TGAGCGTGAT TTGACTCTTC GTGGAGAAAA ATCAGGTATC 24 00 

ACTTTCCGAG TGGGTCAGCA GATCCGTATC CGTGTTGAAA GAGCGGATAA AATGACTGGA 24 60 

GAG AT T G AT T TTTCATTCGT ACCTAGTGAG TTTGATGTGA TTGAAAAAGG CTTGAAACAG 2520 

TCTAGTCGTA GTGGCAGAGG GCGTGATTCA AATCGTCGTT CGGATAAGAA GGAAGACAAG 2580 

AGAAAATCAG GACGCTCAAA TGATAAGCGT AAGCATTCAC AAAAAGACAA GAAGAAAAAA 2640 

GGAAAGAAAC CTTTTTACAA GGAAGTAGCT AAGAAAGGAG CCAAGCATGG CAAAGGGCGA 27 00 

GGGAAAGGTC GTCGCACAAA ATAAAAAGGC ACGCCACGAC TATACAATCG TAGATACGCT 27 60 

AGAGGCAGGG ATGGTCCTGA CTGGAACTGA AATCAAGAGT GTACGAGCTG CTCGAATTAA 282 0 

TCTCAAGGAT GGCTTTGCTC AAGTGAAAAA TGGAGAAGTT TGGCTGAGCA ATGTTCATAT 2880 

CGCGCCTTAC GAAGAGGGCA ATATCTGGAA CCAGGAACCA GAACGTCGTC GTAAACTCCT 294 0 

GCTCCATAAA AAGCAAATTC AAAAATTGGA ACAAGAGATC AAAGGGACAG GAATGACCTT 3000 

AGTTCCCCTT AAGGTCTATA TAAAAGATGG CTACGCTAAG CTTCTTTTAG GACTTGCCAA 3060 

AGGGAAGCAT GACTATGACA AACGGGAGTC TATCAAACGT CGTGAGCAAA ATCGAGATAT 3120 

CGCGCGTGTG ATGAAAGCTG TTAATCAGCG ATAAAAAGAG GAATTGAAAA TGGAAAAATT 3180 

AGTTGCCTAT AAACGCATGC CTTTGTGGAA TAAACAAACA ATGCCTGAAG CTGTTCAGCA 324 0 
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AAAGCACAAT ACAAAAGTTG GGACTTGGGG GAAAATTACT GTCTTGAAGG GAGCTCTCAA 3 3 00 

GTTTATTGAA TTGACAGAAG AAGGGGAAGT TCTAGCTGAA CACCTCTTTG AAGCAGGGGC 33 60 

AGACAATCCA ATGGCCCAAC CTCAAGCCTG GCACCGAGTG GAAGCTGCCA CAGATGATGT 3420 

GGAATGGTAC TTGGAATTTT ATTGTAAACC TGAGGATTAT TTTGCTAAAA AATACAATAC 34 80 

CAATCCTGTT CATTCAGAGG TCCTAGAGGC CATGCAGACA GTGAAACAAG GGAAAGCTTT 3540 

GGATTTGGGT TGTGGTCAGG GGCGTAATTC TCTTTTTCTA GCCCAGCAAG ATTTTGATGT 3 600 

GACGGCTGTA GATCAAAATG GACTAGCTCT TGAAATCTTG CAAAGCATTG TGGAGCAGGA 3 660 

AGATTTGGAC ATGCCTGTTG GCCTTTACGA TATCAATTCA GCTAGCATTG AACAAGAATA 372 0 

TGATTTTATC GTTTCAACAG TTGTTCTCAT GTTTCTACAA GCGGACCGCA TTCCAGCTAT 3780 

TATTCAAAAT ATGCAGGAGA AAACCAGTGT TGGTGGTTAC AACCTTATCG TTTGTGCCAT 3840 

GGACACGGAG GATTATCCTT GCTCGGTTAA CTTCCCATTC ACCTTTAAAG AAGGAGAACT 3 9 00 

GGCAGACTAT TACAAGGATT GGGAATTGGT TAAGTACAAT GAAAATCCAG GCCATTTGCA 3960 

CCGTCGCGAT GAGAATGGCA ATCGTATTCA ACTACGCTTT GCGACCTTAC TAGCTAAGAA 402 0 

AATCAAGTAA ACACACATGA AGATTAGGAA TTTTCCTGAT CTTTTTTCTT TTTTACGAAT 408 0 

GATATAGAAA AGGAGGGAAT TCATGTTTGT TGCGAGAGAT GCTAGGGGAG AATTGGTAAA 414 0 

TGTGTTAGAG GATAAACTTG AGAAGCAAGC ATACACCTGC CCAGCTTGTG GAGGCCAGCT 42 00 

CCATTTGCGT CAAGGACCAA GTGTACGGAC GCATTTTGCC CATAAATCCT TAAAAGACTG 4260 

TGATTTTTTC TTTGAAAATG AAAGTCCAGA ACACCTGGCC AATAAGGAAT CCCTCTATCA 4320 

CTGGTTGAAA AAAGAGACAA AGGTTCAATT AGAGTACCCG CTTTCAGAAC TTAAACAGAT 4380 

TGCGGATGTA TTTGTAAATG GCAATCTAGC TCTAGAAGTT CAGTGTAGTC CCTTGCCTCA 444 0 

GAAAGTCCTT AAAGAGCGAA GTGAGGGCTA TCGTAGTCAG GGTTACCAAG TACTGTGGTT 4500 

GCTGGGTCAA AAACTGTGGC TCAAGGAGCG TTTGACTCGT CTACAGCAAG GTTTTCTTTA 4560 

TTTCAGTCAA AACATGGGCT TTTATGTTTG GGAATTAGAC AAGGAAAAAC AAGTTTTAAG 462 0 

ACTCAAATAC CTGATTTACC AGGATCTCCG CGGTAAACTC CATTATCAAA TCAAGGAATT 4680 

TTCCTATGGT CAAGGTAGTT TATTGGAAAT ATTGCGTCTT CCCTATAAGA GACAAAAAAT 4740 

ATCTCATTTT ACAGTTTCTG AGGACAAGGA CATCTGTCGC TATATCCGGC AACAACTTTA 4800 

TTATCAAAAT CTCTTTTGGA TGAAAGAACA AGCAGAAGCC TATCAAAAGG GAGAAAATAT 4860 

CCTGACTTAT GGACTGAAAG AATGGTATCC ACAAATTCGA CCAATAGTGG GCAAATTTTT 492 0 

CCAGATTGAA CAAGACTTGA CTAGCTATTA TCAGCACTTT TATACCTATT ACCAAAAAAA 4980 

TCCTCAAAAT GATTGGCAAA AGCTTTATCC ACCAGCCTTT TATCAGCAAT ATTTCTTGAA 5040 
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AAATATGGTA GAATAGAAAG GATGGAGGAA TCTAATGGTA TTACAAAGAA ATGAAATAAA 5100 

TGAAAAAGAT ACATGGGATC TATCAACGAT CTACCCAACT GACCAGGCTT GGGAAGAAGC 5160 

CTTAAAAGAT TTAACAGAAC AATTGGAGAC AGTAGCCCAG TATGAAGGCC ATCTCTTGGA 522 0 

TAGTGCGGAT AACCTACTAG AAATCACTGA ATTTTCTCTT GAAATGGAAC G C C AG AT AGA 52 80 

GAAGCTTTAC GCTTATGCTC ATATGAAGAA TGACCAGGAT ACACGTGAAG CTAAGTATCA 5340 

AGAGTACTAT GCCAAGGCCA TGACACTCTA CAGCCAGTTA GACCAAGCCT TTTCATTCTA 5400 

TGAGCCTGAA TTTATGGAGA TTAGCGAAAA GCAGTATGCT GACTTTTTAG AAGCTCAACC 5460 

AAAGCTGCAG GTTTATCAAC ACTATTTTGA CAAGCTTTTG CAAGGCAAGG ATCACGTTCT 5520 

TTCACAACGT GAAGAAGAAT TATTGGCTGG AGCTGGAGAA ATCTTTGGTT CAGCAAGTGA 5 5 80 

AACCTTCGCT ATCTTGGACA ATGCGGATAT TGTGTTCCCT TATGTCCTAG ACGATGATGG 5 640 

TAAAGAAGTT CAGCTATCTC ATGGGACTTA CACACGTTTG ATGGAGTCTA AAAAACGTGA 57 00 

GGTTCGCCGT GGTGCCTATC AAGCTCTTTA TGCGACTTAC GAACAATTCC AACACACCTA 57 60 

TGCCAAAACC TTGCAAACCA ATGTTAAGGT GCAAAATTAC CGTGCTAAAG TTCGTAACTA 5 82 0 

CAAGAGTGCT CGTCATGCAG CCCTCGCAGC GAATTTTGTT CCAGAAAGTG TTTATGACAA 58 80 

TTTGGTAGCA GCAGTTCGCA AGCATTTGCC ACTCTTACAT CGCTATCTTG AGCTTCGTTC 5 94 0 

AAAAATCTTG GGGATTTCAG ATCTCAAGAT GTACGATGTC TACACACCGC TTTCATCTGT 6000 

TGAATACAGT TTTACCTACC AAGAAGCCTT GAAAAAAGCA GAAGATGCTT TGGCAGTCTT 6060 

GGGTGAGGAT TACTTGAGCC GTGTTAAACG TGCCTTCAGC GAGCGTTGGA TTGATGTTTA 612 0 

CGAAAATCAA GGCAAGCGTT CAGGTGCCTA CTCTGGTGGT TCTTATGATA CCAATGCCTT 6180 

TATGCTTCTC AACTGGCAAG ACAATCTGGA CAATCTCTTT ACTCTTGTTC ATGAAACAGG 624 0 

TCACAGTATG CATTCAAGCT ATACTCGTGA AACTCAGCCT TATGTTTACG GGGATTACTC 6 3 00 

TATCTTTTTG GCTGAGATTG CCTCAACTAC CAATGAAAAT ATCTTGACGG AGAAATTATT 63 6 0 

GGAAGAAGTG GAAGACGACG CAACACGCTT TGCTATTCTC AATAACTTCC TAGATGGTTT 64 2 0 

CCGTGGAACA GTTTTCCGCC AAACTCAATT TGCTGAGTTT GAACACGCCA TTCACCAAGC 64 80 

AGATCAAAAT GGGGAGGTCT TGACAAGCGA TTTCCTAAAT AAACTCTACG CAGACTTGAA 654 0 

CCAAGAGTAT TATGGTTTGA GTAAGGAAGA CAATCCTGAA ATCCAATACG AGTGGGCTCG 6 6 00 

CATTCCACAC TTCTACTATA ACTACTATGT ATATCAATAT TCAACTGGCT TTGCGGCCGC 6660 

CTCAGCCTTG GCTGAAAAAA TTGTCCATGG TAGTCAAGAA GACCGTGACC GCTATATCGA 672 0 

CTACCTCAAG GCAGGTAAGT CGGACTATCC ACTTAATGTC ATGAGAAAAG CTGGTGTTGA 67 80 
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TATGGAGAAG GAAGACTACC TCAACGATGC CTTTGCAGTC TTTGAACGCC GTTTAAATGA 684 0 

GTTTGAAGCC CTTGTTGAAA AATTAGGATT GGCATAAAAT GGTTGAATCG TATAGTAAGA 6900 

ATGCTAACCA TAACATGCGT CGTCCTGTCG TCAAAGAAGA AATTGTAGAC TTGATGCGTC 6960 

AGCGTCAAAA GCAGGTCACA GGTTTCTTGA AAGAATTGGA AGACTTTGCC CGCAAGGAAA 702 0 

ATATTCCTAT TATTCCCCAT GAAACGGTTG CTTATTTCCG TTTTCTTATG GAAACCATGC 7080 

AGCCTAAAAA TATTCTGGAA ATTGGGACGG CTATCGGTTT TTCAGCTCTC TTGATGGCTG 714 0 

AACATGCGCC AAATGCTAAG ATTACAACTA TTGATCGTAA TCCAGAAATG ATTGGTTTTG 7200 

CCAAGGAAAA TTTTGCCCAG TTTGACAGTC GCAAGCAAAT CACTCTCCTA GAGGGAGATG 7260 

CGGTGGATGT CTTATCTACA CTGACAGAGT CTTATGATTT CGTCTTTATG GATTCTGCCA 7320 

AGTCTAAATA CATCGTCTTT CTGCCAGAAA TCCTCAAACA TTTGGAAGTT GGTGGTGTGG 7380 

TTGTCTTGGA TGATATTTTT CAAGGTGGTG ATGTTGCCAA GGATATTATG GAAGTCCGTC 7440 

GTGGTCAGCG AACCATTTAT CGAGGCCTTC AAAAATTATT TGATGCAACC TTAGACAATC 7500 

CAGAACTCAC CGCAACATTA GTGCCTTTAG GAGATGGTAT TCTCATGCTT CGTAAAAATG 7560 

TAGCAGATGT TCAACTGTCT GAAAGCGAAT GATTTTCAGA AAAATTTAAG AAAAAATAGT 7620 

AAAATAGATA GAGTAACACT TATCTCAAAG GAGTAGACAT GAAGAAAAAA TTATTGGCAG 7 680 

GTGCCATCAC ACTATTATCA GTAGCAACTT TAGCAGCTTG TTCGAAAGGG TCAGAAGGTG 7740 

CAGACCTTAT CAGCATGAAA GGGGATGTCA TTACAGAACA TCAATTTTAT GAGCAAGTGA 7800 

AAAGCAACCC TTCAGCCCAA CAAGTCTTGT TAAATATGAC CATCCAAAAA GTTTTTGAAA 7860 

AACAATATGG CTCAGAGCTT GATGATAAAG AGGTTGATGA TACTATTGCC GAAGAAAAAA 7920 

AACAATATGG CGAAAACTAC CAACGTGTCT TGTCACAAGC AGGTATGACT CTTGAAACAC 7 980 

GTAAAGCTCA AATTCGTACA AGTAAATTAG TTGAGTTGGC AGTTAAGAAG GTAGCAGAAG 8 040 

CTGAATTGAC AGATGAAGCC TATAAGAAAG CCTTTGATGA GTACACTCCA GATGTAACGG 8100 

CTCAAATCAT CCGTCTTAAT AATGAAGATA AGGCCAAAGA AGTTCTCGAA AAAGCCAAGG 8160 

CAGAAGGTGC TGATTTTGCT CAATTAGCCA AAGATAATTC AACTGATGAA AAAACAAAAG 8220 

AAAATGGTGG AGAAATTACC TTTGATTCTG CTTCAACAGA AGTACCTGAG CAAGTCAAAA 8280 

AAGCCGCTTT CGCTTTAGAT GTGGATGGTG TTTCTGATGT GATTACAGCA ACTGGCACAC 8340 

AAGCCTACAG TAGCCAATAT TACATTGTAA AACTCACTAA GAAAACAGAA AAATCATCTA 8400 

ATATTGATGA C T AC AAAGAA AAATTAAAAA CTGTTATCTT GACTCAAAAA CAAAATGATT 84 60 

CAACATTTGT TCAAAGCATT ATCGGAAAAG AATTGCAAGC AGCCAATATC AAGGTTAAGG 8520 

ACCAAGCCTT CCAAAATATC TTTACCCAAT ATATCGGTGG TGGAGATTCA AGCTCAAGCA 8580 
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GTAGTACATC AAACGAATAG TCCAAATCAA TGAGTCAGGG AAAAAACTCG ACTTCAGGAA 8640 
AAAATGAAGC AAACATTCCC ACAATAAAAC GCATAGTACA AGGTTTGTAC TGCCCCCCAA 8700 
AAAGTTAGAC AATTAATTTA TCCGAAGGAT TTAGTTCTGT ATTGCACAGA GCTAAGTCCT 87 60 

TTTAGTTTTA TCTTAATTCT CTTATTGTTG TAATAATCAA TATAGTCTAT AATGGCTCGT 8820 
TCCAATTGAT TAAGTGATTT AAATGTTTTC TCATAGCCAT AAAACATTTC GGATTTTAAA 8880 
ATGCCAAAGA AAGATTCCAT CCTACCGTTG TCTTGGCTGT TGCCCTTACG TGACATGGAT 8940 
GCTTGAATTC CCTTACTCTC TAGGAAGCGA TGATAAGAAT CGTGTTGATA TTGCCAGCCT 9000 
TGGTCACTAT GGAGAATCGT ATTCTCGTAG TGCTTCTCTT TGAATGCCTG TTCCAACATT 9060 
AACGATCAAT CAATTTAATC ATGTACCTAA GATTAGAATT GTTTATCCCA AATTTATTTG 9120 
AAAGCTTCTC TAAGCTATAT CCTTGTTTTC TAAGTTCATA GATCTGAACT TTATCATCAT 9180 
AAGTTAATTT CATAATAAAA ACACCCCAAA AGTTAGATTT TTTCTGTCTA ACTTTTGGGG 9240 
TGTAGTTCAT GTACACCTGA TATGATGCGT TTTATAATTT TAAAGACTTT TTGACCAGCC 9300 
TCATTTTTTT AACTTGATAC TCAGTGAAAA GCAAAGATTA AACTAGGAAG CTAGCTGTAG 9 3 60 

GCTGCTCAAA GAACAGCTTT GAGGTTGTAG ATAAAACTTG TGAGGTCACC AACATATATA 9420 

ATGTGAAGCT GACGTGGTTT GAATAGATTT TAGAAGAGTA TGAGTCTGGA AGTTTTAATG 9480 

GATAATGCAA GATTCCATAG AATGGGTAAG CTAGAGTTCT TATGTGAAGA GTTTGGGCAT 9540 

AAACTTTTAC CTTTTCCTCC CTACTCATCT TAGTATAGAA AAGTGAATCT GAAATAGTAC 9 600 

ATAACTGCTT CTAAAACATT CTTATAAATT GATTTAAATT CTCAAATCAT ATTATTCAGT 9660 

TCTTATTTCA TTTTGTTCTA CAATCCTGTT GAGAAGACAC GTGTTCATAT CAAAAAGGTA 9720 

TTGGCAAGTT GCAATACCTT TTTACGAGGC TCTGTTGTCT TATTTTTGTT TCAACTGACT 9 7 80 

ATATCTCCTA TGGTTCTAGT TCAGAAGGCT AGGCTATAAT TATGATTGAT AAGAAGTATC 9 840 

ATTCCAAGTA TTGGGAGTGA ATGTTTCAAA ATCATGGGTT TCTATAATGG TCAGGCTGGC 9900 

ATTTGCTAGA CCGCCATCTT TACGAAGAAG TGGTTCTTTA TAGCCTAGGA GAGTACGAAG 9 9 50 

ACTGGCAGTA AGATTGGCGC CGTGTCCGAC AATTAGAATA CGTTCAGCTG GACTATCTTT 10020 

TAATGATTTG ATAAATTGGA TGGTCCGTTG AGTTGTACTA TAGAGGGATT CGGCTCCGAA 100 BO 

CATTCGAGTG TCAAATTGAG CAAGATTTGA ACGAAAAGCC TGGATTTGTT GCGGGTAAAT 10140 

AGCTTCCAAG GTTGCAATTT TCAAACCTTC TAACTTCCCA AGTTGCCATT CACGGAGATT 10200 

AGGAACGATT TCTAAAGAAC AGGGGGTATA GAGTTGACTT TGGATAATCT CAGCAGATTT 102 60 

GACCGCTCGA GGTAAATCAC TTGAATAAAT CTGATCAAAA GGAATTTCCT TGAGATACTG 10320 
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ACCAAGTCGT TTTAGGGTTT CAATGGATTC AGGAAGAAGA GGAGAATCAC CACTAGCACC 10380 

TTGAAAACGA CCTTCTTGGT TCCAGAGGGT ACGACCGTGG CGGACAAAGT AGAGTTTCAT 10440 

TACTTGATGT CCTCCAAAAT ATCTACAAAG TCTGCCTTTA CAAAGCTAGC CAAGTCTTGT 10500 

GGCGCGACGA TAATGCTGTG TCCGACTTCG CCTGCAGAGA CAATCATTTG ATCCAAATCT 10560 

AGAGCAATTT TATCGATAAA AATGGGATAA TTGTGTTTCT GACGAATTCC GACAGGATTA 10620 

TTGGCTCCAT GAATGTAACC AGTTGTTTTT TCTAAGTCCT TTTGTGGAAT CATGCTCACT 10680 

TTTTTATTGC CAGAAATTTT AGCTAGTTTC TTTTCAGACA AGTGCTGAGT GATAGGGACA 10740 

ATTCCGATAA TCGGTCCGGT CTTGTCTCCC AAAAGCGCCA AGGTTTTGAA AATCTGATCT 10800 

CGTTCATAAC CTTGAGGAAG CTCTCCTTCT AGGGCATTGA TTTGAATCCC CTGATGAGGG 10 8 60 

ATAGCTGCTT TAGATAGGAT TTGTTCCACC AATGTTTTTT TGATTTTAAC TTTTTTTGCC 10920 

ATTATTTATA TTTATCCTCC AATTGACTCA TCCAAATACC AAGCCAGATT CCCAGCGCAA 10980 

AGAAGAAGGC GATGATGACA TAACCGACAA GTGAAAGTCC TGTGTATTGG ATACTTTCAG 11040 

CGTTTCCTGC ATTTGGAATT AAGATCAAAA GGGTACTTGA TAGGACGATA CCGATGATGA 11100 

AATGATAGAC GAACTGTTTA CGGAGTTCTT CTAGTTCTCC GTCCGTCCAA GCGTAGGCCA 11160 

CTTCTTCTTT CTTGCCTTTA CCTTTGGACA TCTTGTAAAG AGGTGGGAGG GCAATATAGA 1122 0 

CATGACCTGC CTCGACTAGC GGACGCATGT AACGGTAGAA AAATGTCAAG AGCAAGGTCT 112 8 0 

GGATATGGGC ACCGTCGGTA TCCGCATCG 11309 
(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5548 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 

CCATAGTCTA ACAAGTCTTT GTAAAGGTTT ATCCCTGATT CATGTAAAGA TTGTGTAAAG 60 

AATCAAAAAA AGCCACTTTT GAAAAATGGC TGCTCCTAAA AATAGCTTTA AAAATTATTA 120 

GTCCTGTGCG AAAGATTGGT TAGGAAGAAA AATCGTGAAG CAACTGCCTC TGCCAAGCTG 180 

ACTCGTCACC GTGACTTGGC CACCTAATAA TTGACTGAGT TCTTTGACAA TGGCAAGGCC 240 

AAGACCAGTG CCACCAGTTT GTCTGCTTCG ACCTTTATTA ACTCGGTAAA AACGTTCAAA 300 

AATACGATCC TGCTCTAATT GACTAATACC AATCCCTGTA TCTGATACAG AAATCTTAAT 3 60 

GCCTTCGTTC ACCTTTTGGG TCTTGACCTC AATTTTTCCC CCTTGTTCAG TGTAACGGAT 420 
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GGCATTGGAT AAAAGATTGA GTAAGATTTG GGAAAGTAAT TGACTATCTG ATACGAGGGT 480 

GACATCATCT GGCACCTGCA CCTTTAGCTG TAAATCCTTC TTCTTGAGCT GAGGTTGCAA 540 

GCTTTGAGTC AAATCCTGTA CAAATTCTGC CAAAGAAAGG GTCGTCCATT GTATAGGCAT 500 

TTGTTGAGCC TTAGATAAGG TAAGAAGATG CTCAACAATA TGCTCAAGAC GCAAACTTTC 560 

TTTGTAAATA ATGTCTAGAA AGTCATCCTT GAGCGCTTCT TCTTCAGCTG ACATCCCCTT 720 

AATGGTTTCA GCAAAGCCCT TAATCGAAGT AACTGGTGTC CTCAATTCAT GGGAGGCATT 780 

TGAGACAAAG GCTAAATTTA ACTTTTCATA AGTTCTAATC GTTGTTAAAT CATATAGCAA 840 

GACGAGCACA GCTTCCACAG ATTGGGTGGG GCTAAAAACG GGAACTGCTG TCACTTCTAA 9 00 

AATCAAGTCA CCCTCATGAA ACCCACTTAC TTCTTGTTTT AACCTTGTTT TTTGATCAAA 960 

GGCTTGGTGA ACTAAATTCC GAATATCCAT CCGTTTGAGG TCATCAAGTG AACTTATGTC 1020 

GCCGTCCACA TCGGGAAAAT AATGAGGCAG AGAGCGACTG GATAATAACA TCTGACCTTG 1080 

AGCGGAAACT AAAAACGTCC CCATGGTTAG GTGCGACAGA AGAACCTCCA TTGTTTCGGC 1140 

TAGATCCTTG TATTGCTGAT CC'TGTTGGGA GACTTTGGTT TTTAGGCCAG ACACATACTG 1200 

AGCCAAAGAC TTTAAGTCTT CTTGCCCTTT TTCTAAAAAG TATTCACTAC TGGTCAAGAG 12 60 

AGGTTGGTGC AAGGTCTCAA AAGCAACTTC CCATTTCCAA AGGCAAAAGA GCCAGTAGCC 1320 

ACCTAGTCCC AAAGAAAGGG CTAGAAGAAA GAGACCGATG CCTTTACTGA TCCAAGTTAA 13 80 

TGCCATCCCT GCAATCAGAA TGAGGCTAAC ACTTAGATTG ACTAGCCAAA ATTGAAGGTA 1440 

GCGTTTCATC TATAACTCCT TGAACTTATA ACCATAACCC CGAATGGTTC GAATAAATTG 1500 

AGGGGCTTTA GGATTGTCTT CAATTTTTTC CCTCAACTTA CCAATATGAA CGTCCACCAA 15 60 

ACGTGTTTCC TGCCCAAAGT CATACCCCCA GATACGTTCC AAAAGACGCT CTCTAGTCAG 162 0 

TGTCATGTTG GGATGTTTCA TAAGATAGAG CAAGAGTTCA AATTCTTTTG GGGTCAAACT 168 0 

CAGTAACTTA TTCGCCTTGT AGACTTCATG ACGCTCAGGG TATACTTTCA AGGTCCCAAA 174 0 

TAGCCAAGAA TCGTCAGCGA TATTATCTGA ATCATCTCCT TCTTGTTCTC CTTTAGTTCG 18 00 

CCTGAGGACA GCCTTGACAC GCGCCAGCAA TTCTCTAGGG CTAAAAGGCT TGGTCAGGTA 18 60 

GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT TCGCAGAAAC 1920 

CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT CCATGCCATC 1980 

TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT CTGCCAAAGC 2 040 

TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC TTAAATGGTA 2100 

GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTTGTT TTGTCATCTA 2160 
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TTATCTCCTA TTGGTAACAT TATAACACAA TTATCAGAAA TCCTAACATT GCTAAATCAG 2 2 20 

ATTAAATTTG CCTATCAAGA C T AGT ATCTG GTCAAACGCT CAATCATCTC CTTGTGCTCT 22 80 

GGATAGGTCG CCAGTAGATC TACCCTTTCA AATAATTCAA AATCCTCAAA TTCAAAACCA 2340 

GGAGCAACAA GACAAGAAAC CAGAGCATCA TCCTTATCAA CTGTTGATCC CCAAATAGTG 2 400 

CCCTTAGGAA CACAGTAGTG AAGTTGTTGC CCTTTGGATA TGTCCAGGCC TAAAGTGACT 2460 

GCTTCGTAGT GACCATCTGC TGTAATCATG TGAACAGTAA GTGGGGATCC TGCATGAAAA 2520 

TACCAGATTT CATCTGCTGT CAATCGGTGA AAATGTGAAG GATTCGTTTC TTCTAATAAG 25 80 

AAATAAATAC TGGTATAAAG CGCCCTTCCC TTACCAGCAA GGTTTATAGT GTCTGAAGCT 2 640 

TTTTTTGTTT GTCTAAAATA GCCACCTTCA ATATGGGGAG CTAACTCTAG AGTTCTTATC 27 00 

AAGTCTTCTT TATCCGTCGG AGCCAATGGG TTGAAGTAAC TCTTGTTCAA AGTGGTTTTA 2 7 60 

CGATTTCAAG AACTCCTCTC AGTTCTGAGG ACACGGTAAT GATTGATGCG ACGGAAGTAC 2 82 0 

AAATCAATCG CCCTAAAAAA AGAATTAGCG AATGATTCTG GTAAAAAAAA TGCCACGCTA 2 880 

TGAAGGCTCA AGCGATTGTC ACAAGTCAAG GGAGAATTGT TTCTTTGGAT ATCGCTGTGA 2 940 

ACTATTGTCA TGATATGAAG TTGTTCAAAA TGAGTCGCAG AAATATCGGA CAAGCTGGTA 3000 

AAATCTTGGC TGACAGTGGT TATCAAGGGC TCATGAAGAT ATATCCTCAA GCACAAACTC 3 060 

CACGTAAATC CAGCAAACTC AAGCCACTAA CAGTTGAAGA TAAAGCCTAT AACCATGCGC 3120 

TATCCAAGGA GAGAAGCAAG GTTGAGAACA TCTTTGCCAA AGTAAAAACG TTTAAAATGA 3180 

TTTCAACAAC CTATCGAAAT CATCGTAAAC ACTTCGGATT ACGAATGAAT TTGATTGCTG 3240 

GCATTATCAA TCATGAACTA GGATTCTAGT TTTGCAGGAA GTCTATTATT TGGTTAGGTG 33 00 

AATTAGTGAA GCGTTTAGGC AAGTGTCTCT GGTTACGACG TCATGGACTC TAAATCGATT 33 60 

ATATTTAGGG GTCATGACTA GTGAAGCAGT TAGCTAGTTC GCATATAAGC GGCTAGCGTC 3420 

TAACAATTAG GAACTTTAGT TCCAATAACT TTAAGATTAC GACGTTTTAG GACATAAATC 34 80 

GATCATATTT ATGTCCTAAA ACTAGTGAAG CGCCTAGCCA AAGTCCGAAT AGGATTTiJGC 354 0 

GTTAGTTACT TAGATTGCTT TGCAATCAAG TAACTTTGGC GATTTACATC TTCTCTGGCG 3 600 

CTTCTACTCC AAGCAAGCGA AGGGCTTCTT TGAGAACGAC TGCGGTTGCG TAGCTGAGGG 36 60 

CTAGACGGCT GTCGCGTTCT GGGCTTTCAT CCAAGATACG TGTATGTGCA TAGTATTTGT 372 0 

TAAAGGATTG AGCCAGGCTA ATTGCAAATT TAGCAATGAT AGAAGGTTCA AAGTTATCTG 3780 

CCGCACGGTT GATAATACGT GGGAAGTCTT GAATGAGTTT AATGATTTCC CAGCTTTCAG 3840 

TATCATTCAA GCTATAGTTG CCAGCTGTTT CTGGTTTGAA ATCGGCTTTG CGTAAGATAG 3900 

ATTGGATACG AGCGTAGGCA TATTGAACGT AAGGTCCAGT TTCACCCTCG AAGGATACCA 3960 
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TAGCCTCTAG GTCGAAGTCG TATCCATTTG TACGGTCGGT TTTGAGGTCA TAGAATTTAA 4 020 

TGGCTCCAAT CCCAACAGCA TGTGCTACTT GGTCTTTGTT TTCTAGTTCA GGATTTTTAG 4080 

CCTCGATTTG GACCTTGGCA CGGCTAACAG CCTCTGCAAC AGTAGGCTCT AGCAAGATGA 4140 

CATTCCCTTT ACGAGTAGAG AGTTTCTTCC CTTCTTTTGT AACCAAACCA AAAGGAACGT 4200 

GAGTAATGTC GTCACTCCAG TCGTAGCCCA TCTCTTGCAA GACAGCTTTG AGCTGTTTAA 4260 

AGTGGGCAGA TTGTTCTTGA CCAACGACAT AGATAGATTT AGCAAATTGG TATTCGTTTT 43 20 

TACGGTAGAG GGCTGCAGCC AAGTCACGTG TGATATAGAG AGTTGCACCA TCAGACTTCT 43 80 

TGATGAGGGC TGGATGTTCA ATTCCATATT TCTCAAGATT CACAACTTGG GCACCTTCTG 4440 

ATTCAAGAAG TAGTCCTTTT TCAGAAAGAA TGTCTACAAC TGCATCCATC TTATCATTGT 4500 

AGAAGGCTTC TCCGTTATAG CTGTCAAATT CAACCTTCAA TTCATTGTAA AGGCGGTTAA 4560 

ATTCCACTAA ACTTTCATCG CGGAACCATT GCCAAAGAGC GAGAGCTTCC TCATCTCCAT 4 620 

TTTCAAGTTT ACGGAACCAT TCGCGCGCTT CTTCATCCAA GCTAGGGTCA TTTTCAGCTT 4 680 

CAGCGTTGAT GCGGACATAG AGTTTAAGGA GTTCATCGAT TGGATGAGCT TTTACAGCTT 474 0 

CTTCGTCGCC CCATTTTTTG TAGGCAACAA TCAACATCCC AAATTGTTTA CCCCAGTCTC 4 800 

CCAAATGGTT GACCTTGACC GTTTGATAAC CGATTTTTTG GAAAATATGT GACAAGCTAT 4 8 60 

CTCCGATAAC AGTTGAACGC AGGTGGCCAA TAGAAAATGG TTTAGCGATA TTCGGACTAG 4 92 0 

ACATGTCGAT AACAACATTT TCTTGTTTAC CAATATTTTG GTCAGCATAG TGTTCTTTTT 4980 

CAGTGGTAAC AGCTTGCAAT ACTTGAGCAG AAATGGCAGA TTTATCAAGG AAAAAGTTAA 5040 

CGTAAGGTCC TGTTGCGACA ACTTTTTCAA AGGCTTGGCT GTTCATTTTT TCAGCCAGTT 5100 

CAGCCGCAAT CATTTGTGGT GCTTTACGTT CGACTTTTGC AAGAGAAAAA GCAGGGAAAG 5160 

CAATGTCTCC CATTTCTGAG TTTTTAGGGG TTTCCAGTAA CTTTAAAATA GCCTCTTGGT 5220 

CCAGGCTATC AATGATGCTA GATAATTCGC TAGCAATCAA TTCTTTTGTA TTCATTAAGA 52 8 0 

GCTCCTTTTT GGACTTTTCT ACTATTTTAT CACAATTTTA AAGAAAGAAG AAAAAATTTT 53 4 0 

TGAAATCTCC TGTTTTTTTG GTATAATATG GTTATAAATA TAGTTATAAA TATGCACGCA 5400 

AGAGGATTTT ATGAGAAAAA GAGATCGTCA TCAGTTAATA AAAAAAATGA TTACTGAGGA 54 60 

GAAATTAAGT ACACAAAAAG AAATTCAAGA TCGGTTGGAG GCGCACAATG TTTGTGTGAC 552 0 

GCAGACAACC TTGTCTCGTG ATTTGCGG 5548 

(2) INFORMATION FOR SEQ ID NO: 110: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3132 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

TACCCGGTAG TCTTAGCAGA CACATCTAGC TCTGAAGATG CTTTAAACAT CTCTGATAAA 60 

GAAAAAGTAG CAGAAAATAA AGAGAAACAT GAAAATATCC ATAGTGCTAT GGAAACTTCA 12 0 

CAGGATTTTA AAGAGAAGAA AACAGCAGTC ATTAAGGAAA AAGAAGTTGT TAGTAAAAAT 180 

CCTGTGATAG ACAATAACAC TAGCAATGAA GAAGCAAAAA TCAAAGAAGA AAATTCCAAT 24 0 

AAATCCCAAG GAGATTATAC GGACTCATTT GTGAATAAAA ACACAGAAAA TCCCAAAAAA 3 00 

GAAGATAAAG TTGTCTATAT TGCTGAATTT AAAGATAAAG AATCTGGAGA AAAAGCAATC 3 60 

AAGGAACTAT CCAGTCTTAA GAATACAAAA GTTTTATATA CTTATGATAG AATTTTTAAC 42 0 

GGTAGTGCCA TAGAAACAAC TCCAGATAAC TTGGACAAAA TTAAACAAAT AGAAGGTATT 480 

TCATCGGTTG AAAGGGCACA AAAAGTCCAA CCCATGATGA ATCATGCCAG AAAGGAAATT 54 0 

GGAGTTGAGG AAGCTATTGA TTACCTAAAG TCTATCAATG CTCCGTTTGG GAAAAATTTT 6 00 

GATGGTAGAG GTATGGTCAT TTCAAATATC GATACTGGAA CAGATTATAG ACATAAGGCT 6 60 

ATGAGAATCG ATGATGATGC CAAAGCCTCA ATGAGATTTA AAAAAGAAGA CTTAAAAGGC 72 0 

ACTGATAAAA ATTATTGGTT GAGTGATAAA ATCCCTCATG CGTTCAATTA TTATAATGGT 7 80 

GGCAAAATCA CTGTAGAAAA ATATGATGAT GGAAGGGATT ATTTTGACCC ACATGGGATG 840 

CATATTGCAG GGATTCTTGC TGGAAATGAT ACTGAACAAG ACATCAAAAA CTTTAACGGC S00 

ATAGATGGAA TTGCACCTAA TGCACAAATT TTCTCTTACA AAATGTATTC TGACGCAGGA 960 

TCTGGGTTTG CGGGTGATGA AACAATGTTT CATGCTATTG AAGATTCTAT CAAACACAAC 1020 

GTTGATGTTG TTTCGGTATC ATCTGGTTTT ACAGGAACAG GTCTTGTAGG TGAGAAATAT 1080 

TGGCAAGCTA TTCGGGCATT AAGAAAAGCA GGCATTCCAA TGGTTGTCGC TACGGGTAAC 1140 

TATGCGACTT CTGCTTCAAG TTCTTCATGG GATTTAGTAG CAAATAATCA TCTGAAAATG 1200 

ACCGACACTG GAAATGTAAC ACGAACTGCA GCACATGAAG ATGCGATAGC GGTCGCTTCT 1260 

GCTAAAAATC AAACAGTTGA GTTTGATAAA GTTAACATAG GTGGAGAAAG TTTTAAATAC 132 0 

AGAAATATAG GGGCCTTTTT CGATAAGAGT AAAATCACAA CAAATGAAGA TGGAACAAAA 1380 

GCTCCTAGTA AATTAAAATT TGTATATATA GGCAAGGGGC AAGACCAAGA TTTGATAGGT 144 0 

TTGGATCTTA GGGGCAAAAT TGCAGTAATG GATAGAATTT ATACAAAGGA TTTAAAAAAT 1500 

GCTTTTAAAA AAGCTATGGA TAAGGGTGCA CGCGCCATTA TGGTTGTAAA TACTGTAAAT 1560 
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TACTACAATA GAGATAATTG GACAGAGCTT CCAGCTATGG GATATGAAGC GGATGAAGGT 1620 

ACTAAAAGTC AAGTGTTTTC AATTTCAGGA GATGATGGTG TAAAGCTATG GAACATGATT 1680 

AATCCTGATA AAAAAACTGA AGTCAAAAGA AATAATAAAG AAGATTTTAA AGATAAATTG 174 0 

GAGCAATACT ATCCAATTGA TATGGAAAGT TTTAATTCCA ACAAACCGAA TGTAGGTGAC 1800 

GAAAAAGAGA TTGACTTTAA GTTTGCACCT GACACAGACA AAGAACTCTA TAAAGAAGAT 1860 

ATCATCGTTC CAGCAGGATC TACATCTTGG GGGCCAAGAA TAGATTTACT TTTAAAACCC 1920 

GATGTTTCAG CACCTGGTAA AAATATTAAA TCCACGCTTA ATGTTATTAA TGGCAAATCA 1980 

ACTTATGGCT ATATGTCAGG AACTAGTATG GCGACTCCAA TCGTGGCAGC TTCTACTGTT 2 040 

TTGATTAGAC CGAAATTAAA GGAAATGCTT GAAAGACCTG TATTGAAAAA TCTTAAGGGA 2100 

GATGACAAAA TAGATCTTAC AAGTCTTACA AAAATTGCCC TACAAAATAC TGCGCGACCT 2160 

ATGATGGATG CAACTTCTTG GAAAGAAAAA AGTCAATACT TTGCATCACC TAGACAACAG 2 2 20 

GGAGCAGGCC TAATTAATGT GGCCAATGCT TTGAGAAATG AAGT'TGTAGC AACTTTCAAA 22 80 

AACACTGATT CTAAAGGTTT GGTAAACTCA TATGGTTCCA TTTCTCTTAA AGAAATAAAA 2 340 

GGTGATAAAA AATACTTTAC AATCAAGCTT CACAATACAT CAAACAGACC TTTGACTTTT 2 4 00 

AAAGTTTCAG CATCAGCGAT AACTACAGAT TCTCTAACTG ACAGATTAAA ACTTGATGAA 2 4 60 

ACATATAAAG ATGAAAAATC TCCAGATGGT AAGCAAATTG TTCCAGAAAT TCACCCAGAA 2 520 

AAAGTCAAAG GAGCAAATAT CACATTTGAG CATGATACTT TCACTATAGG CGCAAATTCT 2580 

AGCTTTGATT TGAATGCGGT TATAAATGTT GGAGAGGCCA AAAACAAAAA TAAATTTGTA 2 64 0 

GAATCATTTA TTCATTTTGA GTCAGTGGAA GCGATGGAAG CTCTAAACTC CAGCGGGAAG 2 700 

AAAATAAACT TCCAACCTTC TTTGTCGATG CCTCTAATGG GATTTGCTGG GAATTGGAAC 2760 

CACGAACCAA TCCTTGATAA ATGGGCTTGG GAAGAAGGGT CAAGATCAAA AACACTGGGA 2 820 

GGTTATGATG ATGATGGTAA ACCGAAAATT CCAGGAACCT TAAATAAGGG AATTGGTGGA 288 0 

GAACATGGTA TAGATAAATT TAATCCAGCA GGAGTTATAC AAAATAGAAA AGATAAAAAT 2 94 0 

ACAACATCCC TGGATCAAAA TCCAGAATTA TTTGCTTTCA ATAACGAAGG GATCAACGCT 3000 

CCATCATCAA GTGGTTCTAA GATTGCTAAC ATTTATCCTT TAGATTCAAA TGGAAATCCT 3060 

CAAGATGCTC AACTTGAAAG AGGATTAACA CCTTCTCCAC TTGTATTAAG AAGTGCAGAA 312 0 

GAAGGATTGA TT 3132 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 

[A) LENGTH: 14672 base pairs 
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(B) TYPE: nuc.leic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

CGAGATTTCT TTAAATGAAC TACGTGAAAT CTACCCATCA TCCAGATCTG GATATTCTCT 60 

CCTATCTATA AGTAAAGTTT TAGGAGATTT TAATATAAGT TCTCATGCTT TTAAAGCTTC 120 

GGTAAGAGAT TTAAAACCGC TCAGTTTCCC ACTCATTTGC TTCTGGGAGA GTTCTCATTT 180 

TATTATTCTT GAAAAAATTA GTAAAAACAA GTTTTATATT TTAGATCCTG CAAAAGGCAG 240 

GCAGAGAATG TCAATAAGTG AATTTGAAAG GCATTATTCA AATATCATTT TAACATTTAA 3 00 

AAAGTTAGAT AGCTTTATGT CTCGTAAAGA TAATAAGAAG TCGCCTGTTT TAAAGTATTT 3 60 

TTTTAAGTAT AGGAATAAGC TAGGGATTTT ATTTTTTGTA ACAGCATTAT TGTATGTAAT 420 

ACAATCATTA GTACCTATAG CTAATAGATA CATAATTGAC ACGAATTTCA AGGACGATTC 4 80 

GTATTCGTCT AGAATGTTAT TTACTATATT ATTTATATTT ACTGTTTCAT TCTCACTAAT 54 0 

GTATTTATTA AGACAGATAT ATGTTGCATC CTTAAAATAT ATAATGGATA AAGAGATTAG 600 

CTATGATTTT ATGAAACATT TGATATATTT ACCTTACAGT TTTTATGAAA AACGTACTTT 6 60 

AGGGGATATA CTTTTTAGAG CTAACTCTAT TGTTTATATA AGAGAAATAC TATCAAATAA 72 0 

TTTTATAGCA GCTATACTTG ATTTGTTAAT GATTGTGGTT TATGCTGTGG TTTTATTTAG 780 

CTTTTCTAAG TACATGGTAA TCTTTTTAAT ATCACTAAGT CTAGCTCTAT CTATTGTAAT 84 0 

GTATCCAATC ATAAAAATCT CAAAAAATTT AATTGATAAA AATATAAAAG AAAAGGTTAA 900 

TGTTCAAAAT ATTACTTCCG AAGTAATTTC TAAAAATAGT GATATTAAGC TAACTGGAGA 9 60 

AGAGGAATTT TGGATTAACA AATGGGATAA TTTTAATACA AAACAGCTCA TCATAGGTCG 102 0 

AAAACTTGAT ATACATTTAT CAATTGTTAG TAGTATAACG AATGTTTTAC AAATTATTCT 1080 

CCCTGTTTTG ACCCTTATTG TAGGTGTAAA TATAAAAACA TTCGAACAAT TGACGTTAGG 114 0 

ACAAATTGTA GCAATAAGTA CAGTCTCACC ATACTTTATT TCTCCTATAA TTTCTTTAAG 12 00 

TGATAACTAT ATACAATTAA TGTTATTAAA GGGATATTTT TTAAGAATAG AGGATGTGTT 12 60 

TAATACTAAA TCCGAATTAA TTCCAGAAAG AGTCAGTCAA GATATAAAAT TTGATAAAAA 132 0 

AATAGAATTA AAAGATATTT GGTATAAATA TGGATTATTT GATGATTATG TTTTGAAAGG 1380 

AATAAATGTT ACTATTAAAA AAGGAGAAAC TGTTGCTATT GTTGGAGAAT CAGGTTCAGG 1440 

TAAGAGTACA TTAGCTAAAA TTTTATTAGG TTTATTAGAA CCTAATATTG GTTCAATAGA 1500 

AGTTGATGGA GTAGAAAAAG AAGAAATTGG TCAAACATTG TATAGAAAGA TTTTTGGAGC 15 60 
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AGTGTTACAA AATTCAACCC TAAGTTATGG TACCTTAAGA GAGAATTTGA CATTTGGACA 162 0 

CTTTGTTTCA GATGAAGAAT TAATGACAAA TCTAAATTCA ATTGGTCTTA GCAATGTAGT 1680 

TAAATCTTTA CCTCTTGGAT TAGAGACAAT CATCGCTGAA GAAGGTAATA ACTTTTCTGG 1740 

AGGGCAGCAG CAAATGATAC TTTTAGCTCG TTGTCTTTTG TCGAAACCTT CGGTAGTTGT 1800 

TTTGGACGAA GCAACAAGTA GTTTAGATAA TTTATCTCAA CAAATTACAA CTTCTTACTT 1860 

AAGTGAAATC GGTACCACTA AGATTTTAAT TGCCCATCGA CTAGATACTA TCAAGTCTGC 192 0 

AGATAAGATC TTAGTAATGC ATAATGGTGA AATTGTAGAG ATTGGGACCC AT AG AG AAC T 1980 

TCTTGAACTA GGAGGCATTT ATAAGCAATT GTATTCAAAT AATTAGTTTT TGATTAAAAG 2040 

GGTAAATTTA TGAAGATTAT GAAAAAAAAA TATTGGACTT TAGCGATATT ATTCTTTTGT 2100 

TTGTTCAATA ATTCTGTTAC TGCTCAAGAA ATACCTAAAA ATCTTGATGG CAATATAACT 2160 

CACACTCAGA CTAGCGAAAG TTTTTCTGAA TCTGATGAAA AACAGGTTGA CTATTCTAAT 222 0 

AAAAATCAAG AAGAAGTAGA CCAAAATAAA TTTCGTATTC AAATCGATAA GACAGAATTA 22 80 

TTTGTAACAA CAGATAAACA TTTAGAAAAA AACTGTTGTA AATTGGAACT TGAACCACAA 23 4 0 

ATAAATAACG ATATTGTTAA CTCTGAAAGT AATAATTTAC TAGGCGAAGA TAATTTAGAT 2400 

AATAAAATTA AGGAAAATGT TTCTCATCTA GATAATAGAG GAGGAAATAT AGAGCATGAC 24 60 

AAAGATAACT TAGAATCGTC GATTGTAAGA AAATATGAAT GGGATATAGA TAAAGTTACT 252 0 

GGTGGAGGCG AAAGTTATAA ATTATATTCT AAAAGTAATT CTAAAGTTTC AATTGCTATT 2580 

TTAGATTCAG GAGTCGATTT ACAAAATACT GGATTACTGA AAAATCTTTC AAATCACTCA 2640 

AAAAACTATG TCCCCAATAA AGGATATTTA GGAAAAGAGG AGGGAGAGGA AGGAATAATA 2700 

TCAGATATTC AAGATAGATT AGGTCATGGT ACGGCTGTTG TAGCTCAAAT TGTAGGGGAT 2760 

GACAATATTA ATGGAGTAAA TCCTCACGTT AATATTAACG TCTATAGAAT ATTTGGTAAG 2 82 0 

TCGTCAGCTA GTCCAGATTG GATTGTAAAA GCAATTTTTG ATGCTGTAGA TGATGGCAAT 2 880 

GATATTATCA ATCTTAGTAC TGGACAATAT TTAATGATTG ATGGAGAATA TGAGGACGGA 2940 

ACAAATGATT TTGAAACATT TTTGAAGTAT AAAAAGGCTA TTGATTACGC GAATCAAAAA 3000 

GGAGTAATTA TAGTAGCTGC ATTAGGGAAT GACTCCCTAA ATGTATCAAA TCAGTCAGAT 3060 

TTATTGAAAC TTATTAGTTC ACGCAAAAAA GTAAGAAAAC CAGGATTAGT AGTTGATGTT 312 0 

CCAAGTTATT TCTCATCTAC AATTTCGGTC GGAGGCATAG ATCGCTTAGG TAATTTATCA 3180 

GATTTTAGCA ATAAAGGGGA TTCTGATGCA ATATATGCGC CTGCAGGCTC AACATTATCT 324 0 

CTTTCAGAAT TAGGACTTAA TAACTTTATT AATGCAGAAA AATATAAAGA AGATTGGATT 330 0 
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TTTTCGGCAA CACTAGGAGG ATATACGTAT CTTTATGGAA ACTCATTTGC TGCTCCTAAA 3 3 60 

GTTTCTGGTG CGATTGCAAT GATTATTGAT AAATACAAAT TAAAAGATCA GCCCTATAAT 342 0 

TATATGTTTG TAAAAAAATT CTGGAAGAAA CATTACCAGT AAAAAATGGT ATAAAAGTGT 3480 

TAAATATACC AAACGTATTG AGATATGATT TGAATATGTT ACAATTAGAA TATAAAAATG 3540 

AACAAAGTTG GGATAGTTTC ATAGATAATG TTAATTTAAT TGAGTTGGAA GAGAGAATTC 3 600 

AAACTACTAT TGGAATTAAA CAAATAAACA CACACAATAT TATTACTATT GCCCGAGAAG 3 6 60 

GGTACTCTCA AAATTATTTA CCTAACACTT CAGAAAATAC ATATAATTCA TTACAAGTCA 3720 

GTTTAGTTGG AGTATTACTA CTTTTTATAA GTATGGTAAA TATTTTATGG GCTAAAAAAA 37 80 

GTAAATGAAA ATAAAATTTG GAGCCCTCTG AAAAAGTAAG TCCTACAGTT CAACTAAAAT 3840 

GAGTCAAAAG ATGAATCACC TTGATGTAGG GGAGTTTGTC TTATTGCTGC CTGAACACCT 3900 

CCGTTCAGAG GAAGAACATT ATAAATCTGT TTTTGAAGAC GACTTAACCA GTCGCATATC 39 60 

TAGTCAAGAT GAACGACAGC AAATGACTGC TACGGTAGGT TATTTAGAAT CAGGTCAGGA 402 0 

TCGTTTTGTG TATAATACGA CCCCTATTTC TTACCAGCAG TTTTTGAAAG ATCCAATCAT 408 0 

CATTGTTATA ACACCCCAAT CAACTGGTCC ACAGTCCATT TTGTTTTGGA TAGACGCAGT 414 0 

ACAGAACTAC GTTCTCTTTA ATCAATTGTC TGATGCCCAG GAGCTTATCC AGAGACAAGG 42 00 

CATTGAAAAT TGGGTCTCAG AAATGCAAAC AGGTTACCAC AACTACATCA CATTATTGGA 4260 

TAATATCCAG AGGGAACGTT GGGTAATGCT AGCAGGAGCT GTGCTTGGGA TTGCAACTTC 432 0 

AATCTTGTTG TTTAACACTA TGAATAGGCT CTACTTTGAA GAATTTAGAC GTGCCATTTT 43 80 

TATCAAACGC ATTGCAGGTC TCAGGTTCTT AGAAATCCAT CGCACTTATC TCTTTGCTCA 444 0 

ACTGGGTGTG TTTTTACTGG GATTTGTTGC GAGTGTATTT CTTCAGGTAG AGATAGGAGT 4500 

TGCTTTCTTA GTCTTGTTAC TCTTTACTGG TCTATCTCTT TTACAGTTAC ATGTCCAAAT 45 6 0 

GCAGAAAGAA AACAAGATGT CCATGCTTGT TTTGAAGGGA GGTTAATATG ATTGAACTTA 4620 

AACAGGTGAG TAAATCTTTT GGAGAACGAG AGTTATTTTC GAATCTTTCA ATGACATTTG 46 80 

AGGCTGGAAA AGTCTATGCC TTAATTGGTT CAAGTGGTAG CGGAAAAACA ACCTTGATGA 474 0 

ACATGATTGG GAAATTAGAA CCTTATGATG GGACGATTTT TTACCGAGGT AAAGACTTGG 4800 

CCAATTATAA ATCAAGTGAT TTTTTCCGTC ACGAATTGGG CTACCTCTTC CAGAACTTTG 486 0 

GCTTAATTGA AAACCAAAGT AT T G AAGAAA ACCTTAAGCT AGGTCTCATT GGTCAAAAGT 492 0 

TGAGTCGGTC GGAACAGCGG TTGAGGCAGA AGCAGGCTTT AGAACAGGTC GGCCTGGTTT 4980 

ATCTTGACCT AGATAAGCGC ATCTTTGAGT TATCGGGCGG AGAATCGCAA CGGGTTGCCT 504 0 

TGGCAAAAAT TATCTTAAAG AATCCACCCT TTATTCTGGC AGATGAGCCA ACAGCTTCAA 5100 
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TAGACCCAGC AACCTCTCAG TTGATTATGG AGATTTTGCT ATCTCTTCGA GATGATAATA 5160 

GG C T AAT CAT TATCGCAACA CATAATCCGG CAATTTGGGA GATGGCTGAT GAAGTGTTCA 522 0 

CGATGGATCA TCTGAAATAA AAATCCTTGT TTTTAATTGC ACGATGAGTT ACTGAAATAT 52 80 

TATCATGAAT CAAGAATTGG AGTTAATTTA GAATTGTACT TAATTTAGAA TTGTACTTTA 5340 

TTAATATTGA GGTAACTTTT TCTTGATAAA GGAAGAAATA ATGGAGAGGA AGTTAGAATG 54 00 

AAAAAATTCG ACAATTATAT TATTGAGAAG CCTTGCGATT CTAATTCAGA TAAACTGCAA 54 60 

AAAATCTTAA TAATTGAAAG TTTGGTAGAT GATATTTTGC AATTTTCTCT C AG AAT C AAT 552 0 

AATAGTGTAG GAGAGATTTT CCTCCTACAA CCGTTTTAAA AGAAAACTAT CTTTATTCCA 5 580 

TGTTATTTTG AGGAAGATAT TGTGAAAGTC AAAGATGATG ATAAAGTTGA GTGGAATTTG 5 64 0 

TTAGAATTTC AAAAATTTAG AGCATTTTTG GCTTAGTAAT CTGTGTTGAA GGCTCAAAAC 5700 

CTATGGTAAA AAAGTAGCTT TGAAAACGTA TTGCCTCC'AA AGATTTAGTT AAATAATGAT 57 60 

TTAACACAAA AAGAAATTAT TGAAGTTCTG GAAAGATGTT GTTTCAGTAT TGAGAAAAGG 5 82 0 

TGGGAAAAAC TTGCGATTTT CACAGAGAAA GGAAGAAAAA GTATAGAAAT ATAGTCAATT 58 8 0 

GAAACAAGAA CAGGATAAAA GAACCTTTTG TGCCATATTT TTCTCCTTTC GCTTTACAAT 5 94 0 

TGGATTGAAC ACCTTTATTG TATCGCGTTT GGAGTTTTTT TGGTATAACC TTCGACGCAC 6000 

ACCCGCATAG CGGGTGTTTT TTTTGTCTCG CACCTAACGG AGCGAGACAA ACTAATAGTC 60 60 

ACTTAATCAA AAAACGCACC ATATCAAAAA CTAAAAAGTT TGATATCATG CGTCATGTCT 6120 

TAAACTAATT GACTATACTT TCTATTCAAA TGAGCTTTTA ACCAATTGAT TGAGCCAATC 6180 

CACTCTTAAA ACCAAAGAGC AATTTCTCGC TTAGCTGACT CTTCTGAATC TGAACCATGT 62 4 0 

ACAACATTTT GGATAATCTC ATTTTCTCCA GCAGCTTTTG CAAAATCACC TCGAATAGTG 6300 

CCTGGTAAAG CTTCTTCTGG ACGAGTTGCA CCCATCATGG TCCGCCAAGT TTCGATTACT 63 6 0 

TTGGGACCAG AAATGACACC CACAAGAACT GGACCTGAAG TCATGAATTC ACGAATCGGT 642 0 

GGGTAAAAAC TCTGACCAAC CAAGTCCTGA TAGTGCTGGT CAATCAACTC TTCTGAAACC 6480 

TGTGAACGAA ACTCCAATTT TTCGATTGTA AATCCACGTT GTTCGATGCG CTTTAACACT 6540 

TCACCCACTA GCCCTCTTTT TACACCATCT GGTTTGATGA TAAAGAATGT TTGTTCCATA 6600 

CCCGTCTCCT TTGTCAGCTT CTTTCTTTTA TTTTACCACA TTTCGTGGAA AAATGGAGAA 6660 

AGTTTTCAGA AGAGAGAATG AGAGAACCCT CGGGTTCTCT CATTCTCTCT TATTCTACTG 672 0 

TTTCTTCCAC AGTTTCAACG GCAGTATCCA CAACTACTTC TGTTGTTTCT TCATTTCCTT 67 80 

CTTCCTCTAC TGGAGGATTA AGGTATTCTT CTTCGTTGAC AGCATGTGGT TCAAGGTTAC 6840 
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GGTAACGGGC CATACCAGTA CCAGCTGGGA TGATCTTACC GATGATAACA TTTTCTTTAA 6900 

GTCCAAGGAG ATGGTCTTTC TTACCACGGA TAGCTGCGTC AGTAAGGACA CGAGTTGTTT 6960 

CCTGGAAGGA AGCCGCTGAC AAGAAACTGT TTGTTTCAAG TGAGGCTTTG GTAATTCCCA 7020 

TAAGGACTGG GCGACCTGTC GCTGGAACTC CACCTGCGAT AAGGACATCT TTGTTGGCAT 7080 

CTGTAAAGTC ATTGATATCC ATGAGGGTAC CCATGAGAAG ATCTGTATCA CCTGGATCCA 7140 

TGACACGGAC TTTACGGATC ATTTGACGAA CCATTACCTC GATGTGTTTG TCACCGATTT 72 00 

CTACCCCTTG GCTACGGTAA ACTTTTTGTA CTTCACCGAG AAGGTACGTT TCAACTGACA 72 60 

AGACATCACG AACTGCAAGG AGACGTTTTG GTTGGATAGA ACCTTCTGTC AGAGCAGCAC 7320 

CACGCGCTAC TTGGCCCCCA ACTTCGACAC GCATACGAGC TGTAAATGGA ACGACATATT 73 8 0 

CACCTTCGCC AGTTTCACCC TTAACAAAGA CTTTCTTGGT ACGAGTTGAT GCATCTTCTT 7440 

CGATAGCAGT AACTTGTCCT TTAACCTCTG TAATAACCGC TTCCCCTTTA GGATTGCGGG 7500 

CTTCAAAGAT TTCTTGGACA CGAGGAAGAC CCTGAGTGAT ATCGGTATTT GAGGCAACCC 75 60 

CACCTGTGTG GAAGGTACGC ATTGTAAGCT GTGTACCAGG TTCCCCGATA GATTGGGCAG 762 0 

CGATTGTACC AACTGCTTCA CCAACTTCAA CCGCATCACC AGTCGCCAAG TTGATACCGT 7 680 

AACAGTGACG GCAGACACCG TGACGAGTGT TACATGTAAA TACAGAACGG ATAGTCACTT 7740 

CTTCCACACC AGCATTGACA ATTTCACGCG CCTTGTCTTC TGTAATCAAT TCATTTGGAC 7800 

CAATAATCAC TGCACCAGTT TCTGGATGTT TAACAGTTTT CTTAGTGTAA CGACCGTTGA 786 0 

GACGCTCTTC GAGAGACTCG ATCATCTCTT TTCCTTCTGC GATAGAACGG ATCAAGAGAC 792 0 

CACGGTCAGT TCCACAGTCG TCCTCACGGA TGATAACGTC TTGGGCAACG TCGACCAAAC 7980 

GACGAGTCAA GTAACCTGAG TCGGCTGTCT TAAGGGCCGT ATCGGTCATA CCTTTACGAG 804 0 

CACCGTGAGT TGAGAAGAAC ATTTCCAATA CCGACAAACC TTCGCGGAAG TTTGAAAGGA 8100 

TTGGCAATTC CATGATACGT CCATTCGGAG CAGCCATCAG ACCACGCATA CCGGCAAGCT 8160 

GTGAGAAGTT TGAGATGTTA CCACGGGCTC CAGAGTCCAT CATCATAACG ATTGGGTTCT 8220 

TAGGATCTTG GTTAGCAATC AAGCGTTTCT CAAGTTTTTC ACGGGCAGCA CGCCATTCAG 8280 

CTGTAACAGC ATTGTAACGC TCGTCGTCTG TGATCATACC ACGACGGAAT TGTTTGGTGA 834 0 

TTTGTTCGAC ACGTTTGTGT GATTCTTCAA TGATTTCAGC CTTGTCATCA ACGACTGGGA 8400 

TATCGGCAAT ACCCACTGTC AATCCTGCAA GAGTTGAGTG GTGGTAACCG AGGTTCTTCA 8460 

TGCGGTCAAG TAGGGCAGAA GTTTCTGTCG TACGGAAACG TTTGAAGATT TCAGCGATGA 8520 

TATTTCCAAG GTTTTTCTTC TTGAATGGAG GGTTGAGCTC AAGATTGCTG ATAGCTTCCT 8580 

TGATATCTCC ACCAAGTGGC AAGAAGTATT TAGCTGGAAC ACCTTCTGTC AAGTTGGCAT 8640 
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TGTTTGGTTC TTGCAAGTAT GGTAGCCCCT CTGGCATGAT ATCGTTGAAG AGAATTTTAC 8700 
CAACTGTTGT AAGCAAGACC TTATGTCTTT GCTCTTCTGT CCAAGGCTTG TTGAGGCTGT 8760 
CTGTTGCGAT ACCAACACGT GAGTGGAGGT GAACATAACC ATTGCGGTAA GCCATAACCG 8820 
CTTCGTCACG GTCTTTGAAG ACCATTCCTT CACCTTCGCG ACCAGCTTCT TCCATGGTCA 8880 
AGTAGTAGTT ACCCAAAACC ATGTCCTGAG ATGGAGTAAC TACCGGTTTC CCATCTTTCG 8940 
GGTTCAAGAT GTGCTCAGCA GCTAGCATGA GGATACGAGC TTCTGCTTGT GCTTCTTCTG 9000 
AAAGTGGTAC GTGGATGGCC ATTTGGTCCC CGTCAAAGTC AGCATTGTAG GCTTCACAGA 9 060 

CAAGTGGGTG CAAGCGAAGA GCCTTACCAT CAATCAAGAC TGGCTCGAAG GCTTGGATAC 9120 
CCAAACGGTG AAGGGTCGGT GCGCGGTTCA AAAGCACTGG GTGTTCTTTA ATCACTTCTT 9180 
CAAGGATATC CCAGATACGC TCATCTCCGC GTTCCACCAA GCGTTTAGCT GCTTTGACGT 92 40 

TTTGCACGAT ATCACGGGCA ACGATTTCAC GCATGACAAA TGGTTTAAAG AGTTCAATCG 9 3 00 

CCATTTCACG CGGCACACCA CATTGGTACA TCTTAAGAGT TGGACCAACG GCGATAACTG 93 60 

AACGTCCTGA GAAGTCAACA CGTTTACCGA GCAAGTTTTG ACGGAAGCGT CCTTGTTTAC 9420 
CTTTAAGCAT GTGGCTCAAT GATTTCAATG GACGGCTACC TGGTCCTGTG ATTGGACGAC 94 80 

CACGACGACC ATTGTCAATC AAAGCGTCAA CTCCTTCTTG AAGCATACGC TTCTCATTTT 9540 
GAACGATGAT ACCTGGTGCA TTTAACTCAA GCAAACGAGC CAAACGGTTG TTACGGTTGA 9 600 

TAACACGGCG GTAAAGGTCA TTCAAGTCAG ATGAGGCAAA ACGGCCACCA TCCAACTGCA 9 6 60 

ACATTGGACG AAGATCTGGT GGGATAACCG GAAGGATGTT AAGAATCATC CATTCAGGTT 9720 
TGTTTCCAGA CTTGTAAAAG GCATCCAAAA CATCCAAACG ACGGATGGCT TTGACACGCT 97 80 

TTTGTCCAGT AGCTGTTTTC AATTCTTCTT TGAGTTCAGC AATTTCTTTT TCAAGATCTA 9840 
CTTGCTTCAA AAGGTCTTGG ATGGCTTCCG CACCCATCTT GGCAACAAAT GAACCATAAC 9 9 00 

CATATTCACG CAAGCGCTCT CGGTATTCGC GCTCTGTCAT GATAGACTTG TGCTCAAGTG 9960 

GTGTATCCTT AGGATCAATC ACCACATAAG CCGCAAAGTA GATAACTTCC TCGAGGGCAC 10020 

GAGGGCTCAT ATCAAGGGTC AAGCCCATAC GGCTTGGAAT CCCCTTGAAG TACCAGATGT 10080 

GAGATACAGG AGCTTTCAAT TCGATATGTC CCATACGCTC ACGACGAACT TTCGTACGCG 1014 0 

TTACTTCAAC CCCACAGCGG TCACAAACAA TTCCTCTGTA ACGAATGCGT TTGTACTTAC 102 00 

CACAAGCACA TTCCCAGTCT TTTGTAGGAC CAAAGATCAC TTCATCAAAG AGTCCTTCAC 102 60 

GTTCTGGTTT CAAGGTACGA TAATTGATTG TTTCAGGTTT TTTGACTTCT CCATAAGACC 10320 

ATGAACGGAC TTTACTTGGA GAAGCTAGGG TGATTTGCAT ACTTTTAAAA CGATTTACAT 103 80 
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CAACCACTAT TTCTTCCCTT TCTATTCTAA GTGAACTGCT TATTCTTGTT CAGCAGCTTC 10440 

TTCTGTTGCT TCCGCTTTTG TTGCTTTCTC AGCTTCTTCA GCTTCAAAGG CTGCTTTAGC 10500 

CTCTTGGGCT GCTTTTTCGC GGGCTTTTTC AAGGTCATCT ACGTGGATGA CATCTTCGTC 10560 

CATTCCTTCA TCCAAGTCGC GAAGTTCCAC TTCTTGGTCA TCTTCGTCTA GGACACGCAT 10 620 

GTCAAGACCA AGAGATTGCA ATTCTTTGAC AAGAACTCGG AAGGATTCTG GAACACCTGG 10680 

TTTTGGAATT GGTTTGCCTT TTGTAATAGC TTCATAGGCT TTCAAACGTC CGTTGATATC 10740 

GTCCGACTTG TAAGTCAAGA TTTCTTGAAG GACATTTGAC GCACCGTAGG CTTCAAGAGC 10800 

CCAAACCTCC ATCTCACCGA AACGTTGTCC ACCAAACTGA GCCTTACCTC CGAGTGGTTG 10860 

TTGGGTAACA GTTGAGTATG GTCCGACTGA ACGCGCGTGC AATTTATCAT CAACCATGTG 1092 0 

GTGGAGTTTG ATCATGTACA TGACTCCGAC AGAAACACGG TTATCAAACG GTTCACCAGT 10980 

ACGTCCATCG TAAAGGATCG TTTTGGCATC GCTATCCATA CCTGCTTCTT TAACAGTTGA 11040 

CCAAAGATCT TCAGAACTTG CTCCATCAAA GACTGGTGTA GCGATGTGAA TACCAAGAGT 11100 

ACGAGCTGCC ATACCAAGGT GAAGCTCCAT AACCTGACCG ATATTCATAC GTGATGGTAC 1116 0 

CCCAAGTGGG TTCAACATGA TGTCGACTGG AGTTCCGTCT GGAAGGTAAG GCATGTCTTC 1122 0 

TACAGGAACG ATACGAGAGA CAACCCCTTT GTTTCCGTGA CGTCCGGCCA TTTTATCTCC 112 80 

GACCTTAATC TTACGTTTTT GAGCGATGTA AACACGAACC AACATGTTAA CACCTGATTG 1134 0 

CAACTCATCT CCATTTACAC GTGTAAAGAT CTTAACATCA CGAACGACAC CATCGGCACC 11400 

GTGTGGTACA CGAAGAGAAG TATCACGCAC TTCACGAGAC TTGTCTCCAA AGATAGCGTG 114 60 

CAAGAGACGT TCTTCAGCTG AAAGATCTTT CTCACCCTTA GGTGTTACTT TACCTACAAG 11520 

AATATCACCT TCTTTAACCT CAGCACCAAT ACGGATAATC CCCATTTCGT CAAGGTCTTT 11580 

GAGGGCATCT TCACCAACGT TTGGAATTTC GCGAGTGATT TCTTCAGGCC CAAGCTTTGT 11640 

ATCGCGCGTT TCTGATTCGT ATTCTTCAAG GTGAACAGAT GTGTAGACAT CGTCCTTCAC 117 00 

CAAGCGTTCG CTCATGATAA CGGCATCCTC GAAGTTGTAA CCTTCCCAAG TCATGTAGGC 117 60 

AACGATTGGG TTTTGTCCAA GCGCCATTTC TCCATTTTCC ATAGAAGGTC CGTCAGCGAT 1182 0 

GAAATCGCCT TTTTCAACGA CATCACCAAC TTTTACGAGA GTGCGTTGGT TGTAAGCAGT 11880 

ACCTGAGTTT GAACGACGGA ATTTTTGGAT GTGGTAAACA TCCAATGAAC CATCTTCACG 11940 

ACGAACTTCT ACCTTGTCAG CATCTGCGTA AGTAACTTTA CCATCATACT GAGCAATCAC 12000 

AGCCGCACCA GAATCGTGGG CTGCTTGGTA TTCCATACCA GTACCAACGT AAGGTGCCTG 12060 

AGGATTAATC AATGGCACAG CCTGACGTTG CATATTGGCT CCCATGAGGG CACGGTTGGA 12120 

GTCATCGTTT TCCAAGAAAG GAATACATGC TGTCGCAACG GCAACTACCT GTTTTGGTGA 12180 
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AACGTCCATG TAGTCAACAA TATTAGCTGG ATACTCTTGG TTGACCCCTT GGTGACGTCC 12240 

CATGACAATC TTCTCAGCAA AGGTTCCATC TTCATTCAGA CGAGAGTTAG CCTGAGCTAC 12 300 

AGTATATTCA TCTTCTTCAT CAGCTGTCAA CCAAACAATT TCGTTCGTGA CAACACCTGT 123 60 

TTCACGGTCA ACCTTACGGT ATGGTGTTTG AACAAAACCA TATTTGTTCA AGTGTCCATA 12420 

AGATGACAAG TTATTGATCA AACCGATGTT AGGTCCTTCA GGTGTCTCGA TTGGACACAT 12480 

ACGACCATAG TGAGTGTAGT GCACGTCACG TACTTCATAT CCAGCACGGT CACGAGTCAA 12 540 

ACCACCAGGT CCTAAGGCTG ACAAACGGCG TTTGTGAGAC AACTCAGAAA GCGGGTTGTG 12 600 

TTGGTCCATG AACTGTGACA ACTGTGATGA ACCAAAGAAT TCTTTAACTG CAGCTGTTAC 12 660 

AGGACGGATA TTGATAATTT GTTGTGGTGT CAAGACTTCA TTGTCCTGAA CAGACATACG 1272 0 

TTCACGGACA TTACGTTCCA TACGAGAAAG TCCCAAACGT ACTTGGTTGG CAAGCAATTC 12 7 80 

ACCAACCGCA CGGATACGAC GATTTCCAAG GTGGTCGATA TCATCTACAC GGCCAAGTCC 12 840 

TTCAGCCAAG TTGAGGAAGT AGCTCATCTC AGCAAGGATA TCTGCAGGAG TCACCGTACG 12 900 

AACCTTGTCA TCTGGGTTAG CATTACCAAT GATCGTTACG ACGCGATCTG GATCAGTTGG 12 9 60 

AGCAATAACC TTGAATTTTT GAAGAACAAC AGGCTCAGTC ACAACGGCTG CATCGTTTGG 13 020 

GATGTAGACA ATCTTGTTCA AGTCGCCATC CAAATGGCTT TCAATGCTTT CAATCACGCT 13 080 

ACGAGTCATA ATCGTACCAG CTTCTACCAA GATTTCTCCA GTTTCAGGGT CTACCAATGG 13140 

CTCTGCAATG GTTTGGTTGA GCAAACGTGT TTTAACATTG AGTTTTTTAT TGATTTTGTA 13200 

ACGACCAACT GCTGCCAAGT CATAACGACG TGGGTCAAAG AAGCGAGCTA CAAGCAAGCT 132 60 

ACGTGAGCTT TCAGCCGTCT TAGGCTCACC TGGACGAAGG CGTTCGTAAA TTTCTTTCAA 13 320 

GGCTTCGTCT GTACGAGAGT CCATTGGATT CTTGTGGATA TCTTTTTCAA CAGTGTTGCG 13 3 80 

AACCAATTCG CTGTCACCAA AGATATCAAA GATTTCATCA TCACCTGAGA AACCAAGAGC 13440 

ACGAACCAAG GTTGTAAATG GAATCTTACG AGTACGGTCG ATACGAGTGT AGGTGATATC 13 5 00 

TTTTGAGTCG CTTTCAAGTT CCAACCAAGC TCCACGGTTA GGGATAACAG TTGAACCATA 13 5 60 

GCCCACCTTA CCATTTTTGT CTACTTTGTC GTTAAAGTAA ACACCTGGTG AGCGGACCAA 13 620 

CTGAGAAACG ATAATACGTT CACCACCATT GATGATGAAA GTACCCATTT CTGTCATGAT 13 680 

TGGGAAATCA CCAAAGAAAA CTTCTTGGGT CTTGATTTCG CTTGTTTCTT TATTGATCAA 13740 

ACGGAAGGTT ACAAAAATTG GTGCTGAGTA GCTAGCATCG TGGATACGAG CTTCTTCTAG 138 00 

CGTATATTTT GGTTCCTTGA TTTCATATCC AACAAATTCC AACTCCATTG TGTCTGTGAA 13 8 60 

GTTTGAAATT GGCAATACAT CTTCAAACAC TTCCTTAAGA CCGTGGTCTA GGAAAGCTTT 13 920 
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GAATGAGTCA GTTTGAATTT CAATCAAATT TGGTAAGTCA AGAACTTCTT TGATTCTTGA 13980 

AAAACTACGA CGGGTACGAT GTTTCCCGTA TTGAACGTCA TGTCCTGCCA AGATGATTCT 1404 0 

CCTTTGTAAA TAAGTTCCAA GCCTTGTCAA TCAGGCTTTT CTAATCGTCA TATGGTTGTA 14100 

AACCCCTTAT CACCGTGTCC TCTTGACGAA TTTTCAGAAT CTTTAAGCCT CTGTTACAAA 14160 

TGCTCAAAAT CTTGAAAAAA AGCACAAAAA GAGCAGCTAA ATCTGACTTT TTCAGAAGAT 1422 0 

TTAACTGCTG TGAGCCTTGT CTGGACAATA TTTCAGACAA AACCTACGAC AAATGATTAC 14280 

CCATATTATA CCCTATTTAG CTAGATTTTT CAAGGGGTTT CAGTAGGTTT TTGGTAAATT 14340 

TTTTCCCATA GAAAACTTGG CATCACATTC GAATCACGCT ATGGTACAAA AAACTGAAAA 14400 

AACTATTGAC TGAAAATCAT TTTCAAGGTA TAATAATAAA CGTTAAGGCG GTATAGCCAA 14460 

GTGGTAAGGC ACGGCTCTGC AAAAGCTTGA TCGTCGGTTC AAATCCGTCT ACCGCCTTCT 14520 

ATAACTTGAT TTATCAGGTT TCAAATGAAC AGAAAGCCCA ATTTGAAGGG CTTTTTTTAT 14580 

TTTCCCTCGA ATAAATACGT ATAACTTTAA AAACTTTTGG AGCGAGTTTG TGGCAGAGTT 14640 

CTTTCCATGG CATAATTCCC TTTTGAAATC AG 14672 
(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

AGGAGACTAT TCAAGCCCAA ATTgAGTAGC CCAGCAAAGA CTGTATAGAC TGTGATACCT 60 

TTTTCATAGC CATTGGTAAA GAGAATTTGG GAACCAAGAA TGGTATCTAA GGCCAGGATA 120 

ATCGTACGAA AAGCGAAGAG AGAGGTCAAG ATGCCGCCTC CGATATATTT TTCACTACCG 180 

TAAAGTAGGA TGGCATTTGG TCCTAAAACC ATGAGTCCAA AACTCAGTGG AATGATAAAG 240 

AAGTTAAAGA TTCGACTACC TCTATTAACC AGAGAAACAT AGGCTTCTTT GTCTCCTTTC 3 00 

CCCAGATAGT AACTGAGACG AGGCACACTC ACTCCAATTG CACCTGTTAC AACCCCAGCT 360 

ATAACGGTCA CAATTCGCTG AGCTATGGTA TAGTAACTAA CGTTGACATC AATCCCTGTT 420 

TTAACGAGGA AGAGGCGATC TAAAAAAGTG AAGAGCATAT TGGCATTGGC AAAGAC T AAC 480 

ATGGCTGTCA GAGGGAGAAA GAGTGGTTTA AAATCACTTA GGTGAATTTT AACAAGTTTG 540 

ATGTCTCTTT TAATCCAAAA ATAACTAATC AGGTAGTTAA TCAGCGTCGA TAAACTCATC 600 

ACAAGTGTAT AGACAACAAT ATCGTGTTCA TTTTTAACAA ATAAGAAAAT AGAGACCAGC 660 
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ATCAGGATAC GGATGAAGGC AGTTTTGTAA AAGAGAAAAC 
TTGACCCATT CGATTGAAAA AATCTGGGCA ATGAGTTGAA 

TTTTTGACGA TTGGATTATC AGTAAAGAAG AGAGGATAGG 

GTCAAAATCG TACAAGCGAT GCACAAATAA AAAAGACTAG 

TTTTTGTTAT CCTTGACATT ACTGATAGCC CTTAAACCGT 

GCAAAGGGCA AGAAAAATGA CAAAATAGTG TCGACTGAGT 

CGGTCCAAGA CACGCGCGAC ATAGGTTCCA GTTAGGATGG 

CGAATTCCCA TGTAAGATAG AGCATTTAAT TTTATACTTT 

TTCATTATAT CATAAAGTTA GCTAATAAGA AATGAAGGGC 

TGAAGTTTCA AATCTTAAGT TTTAAGTTTT CTTTAAGGAA 

TCTAAAATTT CGCAGCCATT TATTAGTAAT TGCTACAGAA 

TGGACTAGTT TCTTTGAATA ATAGAACTGC ATAATTCTCC 

AGTATTTCTT TTATGATAGG ACTAGATTGT GGTATAATAG 

AAGACAAAGG AGAAAATAGA TGATTTATGC AGGAATTCTT 

CATGGGGATC AGTAACTTGC CAAAACAATT TTTAGAGCTA 

TCATACAATT GAAAAATTTG TCTTGGAGCC AAGTATTGAA 

TGGAGACTGG GTTTCTCATG CAGAAGATCT TGTAGATAAA 

ACGTATCATC ATTACAAAGG GTGGTGCTGA CCGCAATACA 

AGCCATTGAT GCTTATCGTC CGCTTACTCC AGAGGATATC 

TCGTCCATTT ATTACACTTC GCATGATTCA GGACAATATC 

CGCAGTGGAC ACAGTGGTAG AAGCGGTTGA TACTATCGTT 

TATTACAGAT ATTCCAAATC GTGCTCACCT TTATCAAGGA 

TTGCAAGGAC TTCATGGACC TTTATGGATC TCTTTCTGAT 

AGATGCATGT AAAATCTTTG TGATCAAAGG AAAAGATGTG 

CTCAAATCTG AAGATTACAA CCGTAACAGA TTTGAAGATT 

AGACTAGTAA AATGATTAAT CAAATTTATC AACTAACTAA 

AATATCAGGA AGAGGCTATT GACCAAGAGA ATCATATCCT 

CTGTCTGTCA TGCGGATCAG CGTTACTATC AGGGAAAACG 

AAAAGCTTCC AATGGCAATG ATTCACGAGT CATGTGGAAC 



TGTAATTTTC CAGAGCTTCA 72 0 

TCCCCATAAC AAGGTAGACC 780 

CTAGGATATA GACAGCAGTG 84 0 

AAAAGGTTCT GTTAAGATCT 900 

AGTTATAGAC ACCATAAGTT 960 

TGAAGTAACC ATAGTCAGTT 1020 

GAAAAATAAT ATTCAAGACA 1080 

TCATTCAATT TACCTCGTTT 1140 

AGTAAGTCAA GTAATCACTT 1200 

AGTATATTAT TCTGAAGGAC 12 60 

TTCCTAGTCA TTACTAGAAA 1320 

TATTCTAGAA GGGGAGGACC 1380 

AGAGAATAAG TTTTTTTAGT 1440 

GCCGGTGGAA CTGGCACACG 1500 

GGTGATCGAC CTATTTTGAT 1560 

AAAATTGTAG TTGGTGTTCA 1620 

TATCTTCCTC TTTATAAGGA 1680 

AGTATTAAGA ACATCATTGA 1740 

GTTGTTACCC ACGATTCTGT 1800 

CAACTTGCCC AAAATCATGA 1860 

GAAAGTACCA ATGGTCAATT 1920 

CAAACACCTC AAACATTCCG 1980 

GAAGAGAAGG AAATCTTGAC 2 040 

GCTTTGGCCA AAGGTGAATA 2100 

GCAAAAAGTA TGATTGAGAA 2160 

GCCTAAGTTT ATCAATGTCA 2220 

TATCCGTCCC AACTACATGG 2280 

TGATCCCAAG ATTTTGAATA 2 340 

CGTCATTTCT GACCCGACCG 2400 
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GAACCTACGA GGTTGGTCAA AAAGTTGTCA TGATTCCCAA TCAGTCTCCT ATGCAGAGTG 2 460 

ATGAAGAATT CTATGAAAAC TACATGACAG GGACCCATTT CTTGTCTAGT GGATTTGATG 2520 

GCTTTATGAG AGAGTTTGTT TCTCTCCCTA AAGATCGTGT GGTGGCTTAT GATGCTATTG 2 580 

AAGATACGGT TGCAGCCATT ACAGAGTTTG TCAGTGTGGG CATGCACGCT ATGAATCGTC 2 640 

TATTGACTCT TGCTCATAGC AAGCGGGAGC GGATCGCCGT TATTGGAGAT GGAAGTTTAG 2700 

CTTTTGTGGT TGCCAATATT ATCAACTATA CTTTGCCAGA AGCAGAGATT GTGGTTATTG 2 7 60 

GTCGTCATTG GGAAAAGTTG GAACTCTTCT CATTTGCCAA AGAATGCTAT ATTACGGATA 2 820 

ATATTCCTGA AGATTTGGCC TTTGACCATG CTTTTGAA7G TTGTGGTGGT GATGGTACTG 2 8 80 

GACCAGCTAT TAATGACTTG ATTCGCTACA TTCGTCCTCA GGGAACGATT CTCATGATGG 2940 

GAGTTAGCGA ATATAAAGTC AATCTCAATA CTCGCGATGC CTTAGAAAAG GGCTTGATTT 3 000 

TGGTTGGGTC ATCTCGTTCT GGTCGCATTG ATTTTGAAAA TGCTATCCAA ATGATGGAAG 3060 

TCAAGAAATT TGCCAATCGT CTTAAAAATA TCCTTTATCT AG AAGAAC C T GTAAGAGAAA 3120 

TTAAAGATAT TCATCGTGTC TTTGCAACCG ATTTAAACAC AGCCTTTAAA ACAGTGTTTA 3180 

AGTGGGAAGT ATAAGTACTG GAGGTTAATT GTGGAGAAAA TCATTAAAGA AAAAATTTCT 32 40 

TCCTTACTTA GTCAAGAAGA GGAAGTCCTC AGTGTTGAAC AACTGGGTGG AATGACCAAT 3300 

CAAAACTATT TGGCCAAAAC AACAAATAAG CAATACATTG TTAAATTCTT TGGTAAAGGG 33 60 

ACAGAAAAGC TTATCAATCG ACAAGATGAA AAGTACAATC TTGAACTACT AAAGGATTTA 3420 

GGCTTAGATG TAAAAAATTA TCTTTTTGAT ATTGAAGCTG GTATCAAAGT AAATGAGTAT 34 80 

ATCGAATCTG CGATTACGCT TGATTCAACG TCAATCAAGA CCAAGTTCGA CAAAATTACT 3540 

CCAATATTAC AAACTATTCA TACGTCTGCT AAGGAATTAA GAGGAGAATT TGCTCCTTTT 3 600 

GAAGAAATCA AAAAATACGA ATCCTTGATT GAAGAACAAA TTCCTTATGC CAACTATGAA 3 6 60 

TCTGTTAGAA ATGCAGTCTT CTCCTTAGAG AAAAGACTGG CTGACTTAGG TGTTGACAGA 3 72 0 

AAATCTTGTC ATATCGATTT GGTGCCTGAA AACTTTATCG AATCACCTCA AGGACGACTT 3780 

TATTTGATTG ACTGGGAATA TTCATCAATG AATGATCCAA TGTGGGATTT GGCTGCCCTC 3840 

TTTTTAGAGT CTGAATTCAC TTCCCAAGAG GAAGAAACTT TCTTATCTCA CTATGAGAGT 3900 

GACCAAACAC CGGTTTCTCA TGAAAAGATT GCTATTTATA AAATTTTACA AGATACTATT 3 9 60 

TGGAGTCTAT GGACTGTCTA TAAGGAAGAG CAAGGTGAAG ATTTTGGTGA CTATGGTGTG 4020 

AATCGTTACC AAAGAGCTAT TAAAGGTTTG GCTTCTTATG GAGGTTCAGA TGAAAAGTAA 4080 

AAACGGAGTT CCTTTTGGCC TTCTCTCAGG TATTTTCTGG GGCTTGGGTC TAACGGTTAG 414 0 

TGCTTATATC TTTTCGATTT TTACAGATTT GTCACCCTTT GTGGTGGCTG CAACTCATGA 42 00 
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TTTTTTGAGC ATCTTTATCT TACTAGCTTT TCTCTTGGTA AAAGAAGGGA AAGTTCGCCT 42 60 

CTCAATTTTC TTAAATATTC GCAATGTCAG TGTTATCATC GGAGCCTTGC TAGCAGGCCC 432 0 

TATCGGTATG CAGGCCAATC TTTATGCAGT TAAGTATATC GGAAGTTCTT TAGCTTCATC 43 80 

TGTATCGGCT ATTTACCCTG CGATTTCAGT TCTATTGGCT TTCTTCTTTT TGAAGCACAA 444 0 

GATTTCGAAA AATACTGTAT TTGGGATTGT CTTGATTATT GGAGGGATTA TTGCTCAGAC 4500 

CTATAAGGTT GAACAGGTTA ATTCTTTCTA CATTGGGATT CTTTGTGCTT TGGTTTGTGC 45 60 

TATTGCATGG GGAAGTGAGA GTGTTCTTAG CTCTTTTGCC ATGGAAAGTG AATTGAGTGA 462 0 

AATCGAAGCC CTCTTAATCC GTCAAGTAAC TTCGTTCTTG TCCTATCTTG TGATTGTGCT 4680 

CTTCTCTCAT CAGTCATTTA CTGCAGTAGC CAATGGACAA TTGCTAGGTC TCATGATTGT 4740 

TTTTGCAGCC TTTGATATGA TTTCCTACTT GGCTTATTAT ATCGCTATCA ATCGCTTGCA 4800 

ACCAGCCAAG GCTACAGGCT TGAACGTGAG CTATGTAGTA TGGACGGTCT TGTTTGCAGT 4860 

TGTTTTCTTG GGTGCACCGC TAGATATGCT GACCATTATG ACGTCACTTG TCGTCATTGC 4920 

TGGAGTTTAT AT TAT T ATT A AAGAATAAAG GAGATTCGTG TGAAAGCCAT TATCTTAGCA 4980 

GCGGGATTGG GAACTCGCTT GCGTCCTATG ACTGAAAATA CCCCTAAAGC CTTGGTTCAG 5040 

GTTAATCAAA AACCTTTGAT TGAGTACCAA ATTGAGTTTC TCAAAGAAAA AGGAATCAAT 5100 

GACATCATCA TCATTGTTGG TTATCTTAAA GAACAATTCG ATTACTTGAA AGAGAAATAC 5160 

GGTGTTCGTC TCGTTTTCAA TGATAAATAC GCTGACTACA ATAACTTTTA CTCTCTCTAT 5220 

CTTGTAAAAG AAGAATTGGC CAACAGCTAT GTTATTGATG CTGACAATTA TCTCTTTAAA 5280 

AATATGTTCC GCAATGATTT GACACGTTCG ACTTATTTTA GTGTTTATCG TGAAGATTGT 5340 

ACCAACGAAT GGTTCTTGGT TTATGGAGAT GACTACAAGG TTCAAGACAT TATTGTTGAT 5400 

AGCAAGGCAG GTCGCATCCT TAGTGGTGTA TCCTTCTGGG ATGCTCCAAC TGCAGAAAAG 5460 

ATTGTCAGCT TTATCGACAA GGCTTATGTA AGTGGTGAAT TTGTTGATCT CTATTGGGAC 5 520 

AATATGGTTA AGGATAATAT CAAAGAGCTA GATGTCTATG TTGAAGAATT AGAAGGCAAT 5580 

AGCATTTATG AGATCGATAG TGTCCAAGAC TATCGTAAAT TAGAAGAAAT TCTTAAAAAC 5640 

GAAAATTAAA GATTCCAACA TCTGACAAAA TAGTCGGATG TTTTTTGATT TTTTACGAAC 5700 

TTTTACGAAT AGATAGATGA GTAGAAAAAG AAATGGAGTT ATTTATGAAA ATCACAAACT 57 60 

ATGAAATCTA TAAGTTAAAA AAATCAGGTT TGACCAATCA ACAGATTTTG AAAGTGCTAG 5820 

AATACGGTGA AAATGTTGAT CAGGAGCTTT TGTTGGGTGA TATTGCAGAT ATCTCAGGTT 5880 

GCCGTAATCC AGCCGTTTTT ATGGAACGTT ATTTTCAGAT AGACGATGCG CATTTGTCGA 5 940 
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AAGAGTTTCA AAAATTTCCA TCTTTCTCTA TTTTAGATGA CTGTTATCCT TGGGATTTGA 6000 

GTGAAATATA TGATGCGCCT GTACTTTTAT TTTACAAGGG AAATCTTGAC CTCCTGAAAT 6060 

TCCCGAAGGT AGCGGTCGTG GGCAGTCGTG CTTGTAGCAA ACAGGGAGCT AAGTCAGTTG 6120 

AAAAAGTCAT TCAAGGCTTG GAAAATGAAC TGGTTATTGT CAGTGGTCTG GCCAAGGGCA 6180 

TTGACACAGC AG CTCATATG GCAGCTCTTC AGAATGGCGG AAAAACCATT GCAGTGATTG 62 40 

GAACAGGACT GGATGTGTTT TATCCTAAAG CCAATAAACG CTTGCAAGAC TACATCGGCA 6300 

ATGACCATCT GGTTCTAAGT GAATATGGAC CTGGTGAACA ACCTCTGAAA TTTCATTTTC 63 60 

CTGCCCGTAA TCGCATCATT GCTGGACTTT GTCGTGGTGT GATTGTAGCA GAGGCTAAGA 6420 

TGCGTTCAGG TAGTCTCATT ACGTGTGAGC GAGCAATGGA AGAAGGACGC GATGTCTTTG 64 80 

CTATTCCTGG TAGCATTTTA GATGGACTAT CAGACGGTTG CCATCATTTG ATTCAAGAAG 6540 

GAGCAAAATT GGTCACCAGT GGGCAAGATG TTCTTGCGGA ATTTGAATTT TAAAAATGAC 6600 

CTAAGCTAGA ATTCTAAGAA AAAATCAATT TTAAGAGAAA ATGAACCCAA CATTTCCATA 6 6 60 

ATAAAACGCA TATTAGCAAG TTTTTAACAC TTGATAATAT GCGTTTTTTC TAAGTGGATT 67 2 0 

AGTAGAGTAG AGGATTTTTC TCATATAATA CTCTTCGAAA ATCTCTTCAA ACTACGTCAG 67 8 0 

CTTCCATCTG CAACCTCAAA ACAGTATTTT GAGCgaCTtC GTCAGTCTTA TCTACAACCT 684 0 

CAAAGCAGTG CTTTGAGCAA CCTGTGGCTA GCTTCCTAGT TTGCGCTTTG ATTTTCATTG 6900 

AGTATAAGGG AAAGTATAGT GAATTGAAAT AAGATGTGAA CAACTCTATC AGGAAAGTCA 6 960 

AATTAATTTA TAGAAATATT TTAGCAGCCA AGGTGTACTG TTATAGATTC AATTACACTA 7020 

TAATTTAGTG TAATTGAGAA AGGAGAAATG ATTGTGATTG ATGTTGGCTA GGTTATGTTC 7 080 

AATGATTCCT ACCGTCTCAA ATCTTGTCAG TAAGGAAAAA TAAATTCTTC AAAAGTAGAG 714 0 

ATTACAAGGC TTGTTTAAGA AAGAATTCAA AGACCTTGAC AAATAAAAAT AAAATGGTTA 7200 

TTATAAAAAA TGGTCTGAAA TAGATGATGA TACTTTTCGA AAATCTCTTC AAATACGTCA 72 60 

GCTCAGCTTT GCCTTGCTGT GTTTTGAGCA AGCTACGGTT AGCTTCCGAG TTTGATTT'.?C 73 2 0 

ATTTACTAGA AATGAAACTG AT G AGAGATA TCAGTAGACA TTTGAGTCAG GATATTATGG 73 80 

AAAATGATAA AAAGAGCTCG TGAGATTGGC ATATCAGACT ACTAAAGTAT TGAGTTTGTT 744 0 

AGGATTTTAG CGACTAGTTA GCTGGGAAAG GAAGATATTT GTGACAAATA ATAAACTGTA 75 00 

TTCGTTGATA GAATTTAGAA ATAAAATATA TGAAGAATTA GAACTTTCCA GAAGTGATTT 75 60 

AGCGATTTTA CTATGTGCCA TGCTTATCGC CTCTATCGGA TTAAATATGG ATTCGACTCC 7620 

CGTGATTATT GGAGCCATGT TAATCTCTCC TTTGATGACA CCTATTCTGG GAGTGGGGCT 76 8 0 

CTCTCTAGCT ATATTTGATT TTAAATTGTT AAGAAAATCT TTTAAAATAT TAGCTATTCA 77 40 
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AATTCTTGCC AGTCTAATAG CTTCAACACT TTATTTTTAT CTTTCTCCCA TTTCGTATGC 7800 

TAGTTCGGAG ATTGTTGCTA GAACCTCTCC GACTATTTGG GATGTTCTCA TTGCTTTTGT 7860 

AGGAGGGATA GCAGGTATCA TTGGTGCTAG GAAAAAAGAG AC 7902 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH : 18627 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

GAAGTTGAAA TGGCCAGCTG ATGAGCAATA TCGGTCATAG AAATCTTCTC AATCAACTTT 60 

TGCGCAATTT TTTGGTTGAT AATACGAGGA ATTTGGTGAT TTTTCTTGAC GATAGAAGTT 120 

TCAGCGACCA TCATTTTTGA ACAGTGATAG CACTTGAAAC GACGCTTTCT AAGTAGAATT 180 

CTAGTAGGCA TACCAGTTGT CTCAAGGTAA GGAATCTTAG ACGGTTTTTG AAAGTCATAT 24 0 

TTCTTCAATT GGTTTCCGCA CTCAGGGCAA GATGGGGCGT CGTAGTCCAG TTTGGCGATG 300 

ATTTCCTTGT GTGTATCTTT ATTGATGATG TCTAAAATCT GGATATTAGG GTCTTTAATG 360 

TCTAGTAATT TTGTGATAAA ATGTAATTGT TCCATATGAA TCTTTCTAAT GAGTTGTTTG 420 

GTCGCTTTTC ATTATAGGTC ATATGGGACT TTTTTTCTAC AATAAAATAG GCTCCATAAT 480 

ATCTATAAGG GATTTACCCA C TACAAAT AT TATAGAGCCA AAAATCCTTT GTTTACTAAA 540 

CAAGGGATTT TTCTTTTGTC TCTGCTCCTT TTTTGATATA ATAGTTCTAT GTTAAAATCA 6 00 

GAAAAACAAT CACGTTATCA AATGTTAAAT GAAGAATTGT CCTTCCTATT GGAAGGCGAA 660 

ACCAATGTTT TGGCTAATCT TTCCAACGCC AGTGCTCTCA TAAAATCACG TTTTCCTAAT 720 

ACCGTATTTG CAGGCTTTTA TTTGTTCGAT GGAAAGGAAT TGGTTTTAGG CCCCTTCCAA 7 80 

GGAGGTGTTT CCTGCATCCG TATTGCACTA GGCAAGGGTG TTTGTGGTGA GGCAGCTCAC 840 

TTTCAGGAAA CTGTTATTGT TGGAGATGTG ACGACCTATC TCAACTATAT TTCTTGTGAT 900 

AGTCTAGCTA AAAGTGAAAT TGTGGTGCCG ATGATGAAGA ATGGTCAGTT ACTTGGAGTT 960 

CTGGATCTGG ATTCTTCAGA GATTGAGGAT TACGATGCTA TGGATCGAGA TTATTTGGAA 1020 

CAATTTGTCG CTATTTTGCT TGAAAAGACA GCATGGGACT TTACGATGTT TGAGGAAAAA 1080 

TCTTAATGTA TCAAGCACTT TATCGAAAAT ATAGAAGTCA AAACTTCTCC CAGTTAGTTG 1140 

GTCAAGAAGT TGTGGCTAAG ACTCTTAAAC AAGCGGTGGA GCAAGAGAAA ATAAGTCACG 1200 
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CTTATCTTTT TTCTGGTCCT CGTGGAACGG GAAAAACCAG TGTTGCTAAA ATCTTTGCCA 1260 

AGGCTATGAA CTGTCCCAAT CAAGTGGGTG GCGAACCTTG CAATAACTGC TATATTTGTC 1320 

AAGCAGTGAC GGACGGTAGT TTAGAAGATG TCATTGAAAT GGATGCAGCT TCTAATAATG 1380 

GGGTAGATGA AATTCGCGAA ATTCGTGATA AATCTACCTA TGCGCCTAGC CTTGCTCGTT 1440 

ATAAGGTTTA TATCATAGAT GAGGTTCACA TGCTGTCTAC AGGGGCTCTT AATGCCCTCC 1500 

TAAAGACGCT GGAAGAACCA ACACAGAATG TAGTCTTTAT TTTGGCCACT ACTGAATTGC 1560 

ACAAGATTCC TGCTACTATT CTATCCCGTG TGCAACGTTT TGAGTTTAAA TCAATTAAGA 1620 

CACAGGATAT TAAGGAACAT ATTCACTATA TCTTAGAAAA AGAAAATATC AGTTCTGAAC 1680 

CAGAGGCTGT GGAAATCATT GCCAGACGGG CGGAAGGTGG AATGCGGGAC GCCTTGTCTA 1740 

TTTTGGATCA AGCCCTGAGT TTGACACAGG GAAATGAGCT GACGACTGCT ATCTCTGAAG 1800 

AAATTACTGG CACCATTAGC CTATCAGCCT TGGATGATTA TGTGGCGGCC TTGTCTCAAC 18 60 

AGGATGTTCC CAAAGCTTTG TCTTGCTTGA ATCTTCTTTT TGACAATGGT AAGAGCATGA 192 0 

CTCGTTTTGT GACCGATCTT TTGCACTATT TAAGAGACTT GTTAATTGTT CAAACAGGGG 1980 

GAGCAAATAC TCATCATAGT TCAGTCTTTG TAGAAAATTT GGCACTTCCT CAAAAAAATC 2 04 0 

TGTTTGAAAT GATTCGCTTA GCAACAGTGA GTTTAGCAGA TATTAAGTCT AGTTTGCAAC 2100 

CCAAGATTTA TGCTGAAATG ATGACCGTCC GTTTGGCGGA AATCAAGTCC GAACCAGCTC 2160 

TATCAGGAGC GGTTGAAAAT GAAATTGCTA CGCTGAGACA GGAAGTTGCC CGTCTCAAAC 2220 

AAGAGCTTTC TAATGTAGGT GCGGTTCCTA AACAAGTTGC ACCAGCTCCT AGTCGACCAG 2280 

CTACGGGCAA AACAGTCTAT CGTGTCGATC GCAATAAAGT GCAATCTATC TTACAAGAGG 234 0 

CCGTCGAAAA TCCTGATTTA GCACGTCAAA ATTTAATTCG TTTGCAGAAT GCCTGGGGAG 24 00 

AGGTAATTGA AAGTCTAGGT GGGCCGGACA AGGCTCTGCT AGTTGGTTCT CAACCGGTTG 24 60 

CTGCCAATGA ACACCATGCT ATTCTTGCTT TTGAGTCTAA CTTCAATGCT GGTCAAACTA 252 0 

TGAAACGAGA CAATCTCAAT ACCATGTTTG GTAATATCCT CAGTCAGGCG GCAGGTTTTT 258 0 

CACCTGAGAT TTTAGCTATT TCCATGGAGG AATGGAAAGA AGTTCGCGCA GCCTTTTCAG 2640 

CCAAAGCCAA ATCTTCTCAA ACTGAAAAAG AAGTAGAAGA AAGCCTGATT CCAGAAGGAT 2700 

TTGAATTTTT GGCTGATAAA GTGAAGGTAG AGGAAGACTA AAGAAAGATT TCATGATACA 27 60 

ATAAGTTTAT GAATAAACAA CAATTTATTA TTATGGCGCT GTTTACAGCT GCTGAGACCT 282 0 

ATTTTTTCAA TGAAGCCTGG ATGACTGGCC GCTATATTAT GGCAGCCTTT TGGGCAATTT 2880 

TACTCTTTAG AAATTTCCGA GTCAGTTATG TGATGGGCAA AATCGTTGAT GTCATCGATC 2940 

AGCATTTTAA TAGGAAAGAC TAGCCCTCAG CTTCCAGACA AAATCAAAGC CTTTTAGGCT 3000 
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TTTTTTTGTT ATACTAGAAA AGTATATTTA TAGAATTTTT GCTCTATTTC TGGGGAAATC 3 060 

AGACGTTTTT CTAGTAAGTA CTGTAAAAGT TTTGAAAAAG AAAGGAACTA TCATGTCAGT 3120 

ATTAGAGATC AAAGATCTTC ACGTTGAGAT TGAAGGAAAA GAAATTTTAA AAGGGGTTAA 318 0 

CCTGACCCTG AAAACAGGAG AAATTGCCGC TATCATGGGA CCAAATGGTA CAGGTAAATC 3 240 

GACTCTTTCT GCCGCTATCA TGGGAAATCC AAACTATGAA GTAACTAAAG GTGAAGTTTT 3 300 

GTTTGATGGC GTAAACATCC TTGAGTTGGA AGTGGATGAG CGTGCGCGTA TGGGACTTTT 3 3 60 

CCTTGCTATG CAATACCCAT CAGAAATCCC TGGAATTACC AATGCTGAGT TTCTTCGTGC 3420 

CGCTATGAAT GCGGGTAAAG AAGATGATGA GAAGATTTCA GTTCGTGAGT TTATTACTAA 3 480 

GCTAGATGAA AAAATGGAAT TGCTCAACAT GAAAGAAGAA ATGGCAGAGC GTTACCTCAA 3 540 

CGAAGGCTTC TCTGGTGGTG AGAAAAAACG CAATGAAATT CTTCAACTTT TGATGTTGGA 3 600 

GCCAACATTT GCTCTTTTGG ACGAGATTGA CTCAGGTCTT GATATTGACG CTCTTAAAGT 3 660 

TGTGTCTAAA GGTGTCAATG CCATGCGTGG TGAAGCTTTT GGTGCTATGA TCATCACTCA 3 720 

CTACCAACGT CTTTTGAACT ATATCACACC TGATGTGGTA CACGTGATGA TGGAAGGTCG 3 780 

TGTTGTCCTT TCTGGTGGTC CAGAATTGGC TGCGCGTTTG GAACGTGAAG GATACGCAAA 3 840 

ATTAGCTGAA GAACTTGGCT ACGACTACAA GGAAGAATTG TAATTCCCTC GTATCTTTTA 3 900 

GGAGAAGTAA ATGACTAGAG AAAATATTAA ACTTTTTTCA GAAATGCACG CTGAACCAAG 3 960 

CTGGTTGGCT GATCTCCGTC AAAAAGCTTT TGACAAGATT GAGACTTTGG AATTACCAGT 4020 

TATTGAGTGT GTCAAATTCC ACCGTTGGAA TCTGGGTGAT GGAACGATTA CAGAAAATGA 4 080 

GCCATCAGCA AATGTTCCAG ATTTCACAGC TTTAGATCAT CACTTGAAGT TGGTGCAAGT 4140 

AGGAACTCAA ACTGTTTTCG AACAAACTCC AGTTGAGTTA GCTGAACAGG GTGTTGTCTT 4 2 00 

CACAGACTTT CACTCAGCTT TAGAAGAAAT TCCAGAGCTG ATCGAAGAAT TCTTCATGTC 42 50 

ATCTGTTAAG TATGATGATG ACAAGTTGGC GGCTTACCAC AC AG CTTACT TTAACAGTGG 4 320 

TGCTGTACTC TATATTCCAG ATAACGTAGA AATCACAGAG CCAATTGAAG GAATTTTCTA 4 3 80 

CCAAGATAGC GATAGCAATG TGCCGTTTAA CAAGCATATT ATGATTATCG TTGGTAAAAA 444 0 

TTCTAAGATT AGTTATCTGG AGCGTTTAGA GTCACGCGGT GAAGGAAGTG ACAAAGCAAC 45 00 

TGCCAATATC ACAGTGGAAG TGATTGCACG TTCTGGTGCG CAAGTCAAGT TTGCTGCTAT 4560 

CGACCGTCTA GGTGAAAACG TCACTGCCTA CATTAGCCGT CGTGGTAAAT TAGGCAACGA 4620 

TGCAAGTATT GACTGGGCTA TCGGTGTCAT GAACGAAGGA AATGTCGTTG CTGATTTTGA 4 6 80 

TAGTGACTTG ATTGGTAATG GTAGCCATGC TGACCTCAAG GTTGTAGCTC TTTCAAGTGG 4740 
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TCGTCAGGTA CAAGGGATTG ATACTCGTGT AACTAACTAT GGCTGCAACT CAATCGGAAA 4800 

CATTCTACAA CATGGGGTTA TCCTTGAAAA AGCAACTTTG ACTTTCAATG GTATCGGCCA 4860 

CATCATCAAG GGTGCTAAGG GAGCAGATGC GCAACAAGAG AGCCGTGTTC TCATGCTTTC 4920 

AGACCAAGCG CGTTCAGATG CTAACCCAAT TCTTTTGATT GATGAAAATG ACGTAACTGC 4980 

AGGCCATGCA GCCTCTATTG GTCAGGTAGA TCCAGAAGAT ATGTACTACC TCATGAGTCG 5040 

TGGCTTGGAT AAGGCAACTG CAGAGCGTTT GGTTGTTCGT GGTTTCCTTG GATCTGTTAT 5100 

CGTGGAGATT CCAGTCAAGG AAGTTCGTGA TGAAATGATT GCAACTATCG AAGAGAAATT 5160 

GTCAAAACGC TAAGGGGCAG CCTATGTTAG ATGTAGAAGC GATTCGCAAG GATTTTCCAA 5220 

TTTTAGATCA GATTGTCAAT GATGAACCTC TGGTCTATCT GGACAATGCT GCGACGACAC 5280 

AAAAAC C ACT AGTAGTTCTG AAAGCTATTA ACAGCTACTA TGAGCAGGAC AATGCCAATG 5 340 

TTCACCGTGG TGTCCATACC TTAGCGGAAC GAGCGACAGC TTCTTATGAA GCTGCTCGTG 54 00 

AAACCATTCG TAAGTTTATT AATGCAGGCT CTACAAAGGA AGTTCTCTTT ACCAGAGGAA 5460 

CGACAACCAG CCTTAACTGG GTGGCACGCT TTGCTGAGGA AATTCTCACT GAGGGAGACC 5520 

AGGTCTTGAT TTCAGTAATG GAACACCATT CTAATATCAT TCCATGGCAG GAAGCTTGTC 5 580 

GAAAGACTGG AGCAGAGCTT GTCTATGTCT ATCTTAAAGA CGGTGCCTTG GATATGGAGG 5640 

ATTTGCGAGC TAAATTGACT GATAAGGTTA AATTTGTTTC CCTAGCTCAT GCCTCCAATG 57 00 

TTCTTGGTGT GGTCAATCCG ATCAAGGAAA TCACTCAATT AGCCCACCAA GTTGGGGCAA 5760 

TTATGGTAGT GGATGGTGCT CAATCTACAC CTCATATGAA GATTGATGTC CAGGACTTGG 5 820 

ATCTGGACTT TTTCGCCTTT TCGGGTCACA AGATGGCTGG TCCGACTGGT ATCGGTGTCC 58 80 

TTTACGGCAA AGAAAAGTAT CTTGAGCAAA TGTCTCCAGT AGAATTTGGC GGCGAGATGA 5 940 

TTGATTTTGT CTACGAGCAA TTTGCTAGTT GGAAGGAATT GCCTTGGAAA TTTGAGGCTG 6000 

GAACGCCAAA TATGGCAGGA GCTATTGGAC TTGCGACTGC AGTTGATTAT CTGGAAAAGA 6050 

TTGGTATGGA TGCCGTTGAA GCTCATGAAC AGGAATTGAT TGCGTACGTC TATCCAAAAC 6120 

TGCAGGCAAT TGAGGGATTG ACCATTTACG GTTCTCAGGA TTTGGCTCAA CGTTCGGGTG 6180 

TTATTGCCTT TAACCTAGGT GATCTCCATC CTCACGATCT TGCGACGGCT CTGGATTATG 62 40 

AAGGAGTGGC TGTTCGTGCT GGTCACCATT GTGCGCAACC CTTGCTTCAG TATTTGGAAG 63 00 

TCCCAGCAAC AGCTCGTGCA AGTTTTTATA TCTACAATAC CAAGGCAGAT TGCGACAAAC 63 50 

TAGTCGATGC CCTACAAAAG ACAAAGGAGT TTTTCAATGG CACTTTCTAA ACTAGATAGC 6420 

CTTTATATGG CAGTGGTAGC AGACCATTCG AAAAATCCAC ATCACCAAGG GAAGTTAGAA 6480 

GATGCTGAGC AAATCAGTCT CAACAATCCG ACTTGTGGGG ATGTCATCAA CCTCTCTGTC 6540 
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AAGTTTGATG CAGAGGACCG TTTGGAAGAT ATTGCTTTTC TAAATTCAGG ATGCACGATT 6600 

TCAACTGCTT CTGCTAGTAT GATGACAGAT GCCGTTTTAG GAAAAACCAA ACAAGAAATT 6660 

TTAGAACTGG CGACTATTTT TTCTGAAATG GTTCAAGGGC AAAAAGATGA GCGTCAAGAC 672 0 

CAACTTGGAG ACGCGGCATT CTTGTCAGGT GTTGCCAAAT TCCCTCAAAG AATCAAGTGT 6780 

GCAACCCTAG CTTGGAATGC CCTTAAGAAA ACAATTGAAA ATCAAGAAAA ACAGTAAGAC 6840 

AAGTTTCTTT TGTCTTATGA ATTATTAGAA ATGAAGAAAG AAAGGATACT ATGGCTGAAG 6900 

AAAGAGTAGA ACCAAAACCA ATTGACCTTG GTGAATATAA ATTTGGTTTC CATGACGATG 6960 

TAGAGCCTGT CTTATCGACA GGAAAAGGAC TCAACGAAGG TGTTATTCGT GAATTATCTG 7020 

CTGCTAAGGG TGAGCCTGAG TGGATGTTGG AGTTCCGTTT GAAGTCTTAT GAAACCTTCA 7080 

AAAAAATGCC CATGCAAACT TGGGGAGCAG ACTTGTCAGA GATTGACTTT GATGACTTAA 7140 

TCTACTACCA AAAACCATCT GACAAACCAG CCCGTTCTTG GGATGATGTA CCTGAAAAGA 7200 

TTAAAGAAAC CTTTGAACGT ATCGGGATTC CAGAAGCTGA ACGTGCTTAT TTAGCAGGGG 72 60 

CTTCTGCCCA GTACGAGTCA GAAGTGGTTT ACCACAACAT GAAGGAAGAG TTCCAAAAAT 7320 

TAGGTATTAT CTTTACAGAT ACAGATTCCG CACTCAAGGA ATACCCAGAC TTATTTAAAC 73 80 

AATACTTTGC GAAGTTGGTA CCGCCGACAG ATAACAAGTT GGCAGCCCTC AACTCAGCAG 7440 

TATGGTCGGG TGGAACTTTT ATCTACGTGC CAAAAGGTGT CAAGGTAGAT ATTCCACTTC 7 500 

AAACTTATTT CCGTATCAAT AACGAAAATA TAGGTCAGTT CGAACGTACC TTGATTATCG 7 560 

TTGATGAGGG AGCAAGCGTC TACTACGTAG AAGGATGTAC AGCACCAACA TATTCAAGCA 7 620 

ATAGCTTACA CGCTGCCATT GTAGAAATTT TTGCTTTGGA CGGAGCTTAT ATGCGTTATA 7 680 

CAACTATCCA AAACTGGTCT GATAACGTCT ATAACTTGGT AACAAAGCGT GCTAAGGCTC 7740 

AAAAGGATGC CACTGTTGAG TGGATTGATG GAAACTTGGG TGCCAAAACG ACTATGAAAT 7800 

ATCCATCTGT TTACCTTGAT GGAGAAGGAG CGCGTGGTAC CATGCTCTCT ATCGCCTTTG 7 860 

CTAATGCAGG GCAACACCAA GACACGGGTG CTAAGATGAT TCACAATGCT CCACATACCA 7920 

GCTCGTCTAT TGTGTCTAAA TCCATCGCTA AAGGTGGAGG AAAGGTTGAC TACCGTGGAC 7 980 

AAGTCACCTT TAACAAGAAC TCTAAGAAAT CTGTTTCCCA CATTGAATGT GATACCATTA 8040 

TCATGGATGA CTTGTCAGCA TCAGATACTA TTCCATTTAA TGAAATTCAC AACTCGCAAG 8100 

TGGCTTTGGA ACACGAAGCC AAAGTATCTA AGATTTCAGA AGAGCAATTG TATTATCTCA 8160 

TGAGCCGTGG ATTGTCAGAA TCTGAGGCAA CTGAAATGAT TGTCATGGGA TTTGTAGAAC 8220 

CCTTTACAAA AGAACTTCCA ATGGAATACG CAGTTGAGCT GAACCGCTTG ATTAGCTATG 82 80 
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AAATGGAGGG ATCAGTTGGA TAAAATTTGA TTTTATACTC TTCGAAAATC TCTTCAAACC 834 0 

ACGTCAGCAT CGCCTTACCG TATGTATGGT TwCTGAtTCG TCAGTTTCAT CTACAACCTC 8400 

AAAACAGTGT TTTGAGCAAC tGCGGCTAGC TTCCTAGTTT GTTCTTTGAT TTTGAGTATT 84 SO 

AGATTTACTC AAAATCAAGG ATTTTGAAGA TGAACTTGTA TCAAAAAATC GCGGTTTAAA 852 0 

ATCGCGATTT TTTATAATTT CTCGTTAACA AAGCGGACAA ACTGATTCCA CCAAACTTTT 8580 

AAGAAGAAGG CTTTTTCAAT TTTCTTGTCT GCTACCATTT CGAAACTAGG GCGCTCTGTG 8640 

GTGATGTAAC CTTGACCAAT CAAGTCCTTG TCTTCATAAG TCAAATGGCC AACCACTGTT 8700 

CCAGCTTCAA GTGGTGCTGG GATTGCTTTG GAATCAGGTG TGAATTGAAC AGATTGGGAA 8760 

GATTGATTCC CAACACGTTC GATTAGATAG ATATCCTCTG GAGCCACTGC AGTTACTGTA 8820 

TCTTCTTTTC CATCTTGTAC AGGGGCTTTG CTATCTTGAT AGGCATCGCC TTGTTGAACG 888 0 

ATTTTGCGAA GTGTAAATGT AGAAGAAATA TAATCCATTA GGGAAGATGT AGCTGTAAAT 894 0 

CGAGCGTAAG GATTATTGTC TTGATGATCT GCATTTAAAA CAACTGTGAT GACTCTCATG 9000 

CCTTTTTCGA CAGTAGTACC AACAAAAGAC TCTCCAGCCT TATCTGTTGT TCCTGTTTTT 9060 

AGCCCATCAA AACCACCACG GTAAGCAGGC ATACCTTCTA ACATGTAGTT GGTTGAAGTG 912 0 

ATTGTCATCC CAGCAAAAGT AGAAGAAGGT TTTTTGGTGA TTTCTAAGAC TTGTGGGTAT 9180 

TTTTTGATGA GGTTGCGAGC AACGATAGCG ACATCATAAG CACTAAGCTT ATTTTCCTCA 9240 

TCTTTTTTAG AACCTGGGTA AATGTTATCC CCTAGAGTTT CATTGTTAAG ACCTGTCGTA 9300 

TTGACAACAG TGGCATCCTG AATTCCCCAT TCCAAGAGTT TTGCCCGCAT CATATCGACG 93 60 

AAATCTTTTT CTGAGCCAGC AATTTTCTCA GCTAGGGCAA TAGCGGCGCT GTTGGCACTA 9420 

GATACCAGAG TTGCTTCAAG CAACTCTTCG ACAGTATAAT TACGGGCCTC CATAGGAATA 9480 

TTACTGGCTT CAGAATTTGT CGTCAATTGA TAAGGATAAT CAGAAATATC TACAGGAGTG 9540 

GAGAGGGTAA TACTTCCGTT TTCCAAAGCT TCATAGACCA GATAAACAGT AATCAATTTT 9 600 

GTTATGGAAG CAATTTCGAC AGGTTGCGTT GCATCCTTCT CATAGAGAAT TTTACCAGTA 9 660 

TTTGCCTCAA CAGCAATCGC ATGTTTAGCG GCAATGGTAA AATCTTGAGC AACAGCAGTA 9720 

GAAGCACCCC CTAAAAGAGA GACAGTTAAC AAAGTTAAAA ATATTTTTTT CATAGTAGTC 97 80 

TTATTCTATC ATAAAGAAAA AAAATATTCT TGCTTTAATA ATTCATCTGT TAAGCTTTTT 9840 

GAAAATATGG TAAAATAAAG TAAGGGAGGT AACTCATGTT TCGTAGAAAT AAATTATTTT 9900 

TTTGGACCAC AGAAATTTTA CTCTTAACCA TCATCTTTTA CCTATGGAGA CAGATGGGGT 9960 

CTTTGATTAA CCCTTTTGTT AGCGTGCTTA ATACAATTAT GATTCCATTT TTATTAGGGG 10020 

GCTTTTTTTA TTATTTGACA AACCCTATTG TTACTTTCTT AAATAAAGTC TGTAAACTCA 10080 
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ATCGTTTGCT TGGTATTTTA ATTACCTTGT GTACTTTGGT CTGGGGAATG GTCATAGGTG 10140 

TTGTCTATCT CTTACCTATT TTGATTAATC AGTTATCTAG TTTGATTATA TCTAGTCAAA 10200 

CTATTTATAG TCGAGTACAA GACTTAATCA TAGACTTATC TAATTATCCT GCGCTCCAGA 102 60 

ATTTGGATGT AGAAGCTACA ATTCAGCAGT TAAACTTATC CTATGTTGAT ATTCTTCAAA 10320 

ATATCCTAAA TAGCGTATCA AATAGTGTGG GGAGCGTCTT GTCAGCTCTT ATCAGTACTG 10380 

TTTTGATTTT GATTATGACT CCAGTTTTTT TGGTTTATTT CTTATTAGAT GGACATAAAT 10440 

TCTTGCCCAT GCTTGAAAGA ACGATTCTAA AGAGGGATCG CTTGCATATT GCAGGCTTAT 10500 

TAAAGAATTT AAATGCGACG ATTGCTCGCT ATATTAGTGG AGTTTCGATT GACGCAATCA 10560 

TTATAGGTTG TTTGGCTTAT ATTGGCTATA GTATTATTGG TTTAAAATAT GCTTTAGTTT 10620 

TTGCCATTTT TTCTGGTGTA GCCAATTTAA TTCCTTATGT GGGGCCAAGT ATTGGTTTGA 10 680 

TTCCTATGAT CATCGCAAAT ATATTCACTG ATCCCCATAG ACTGCTGATT GCAGTGATTT 10740 

ATATGCTTGT TGTTCAGCAG GTAGATGGCA ATATCTTATA TCCTCGAATC GTAGGAAGTG 10 800 

TTATGAAGGT TCATCCAATC ACGATTTTAG TTTTACTTTT GTTGTCAAGC AATATCTATG 10 860 

GTGTAGTTGG AATGATTGTC GCAGTGCCAA CCTATTCTAT CTTGAAAGAA ATTTCTAAGT 10920 

TCTTATCCCA TTTGTATGAA AATCATAAAA TAATGAAAGA ACGAGAAAGA GAATTAGCTA 10980 

AGTAAAAGTC AGGAGAACCC TGATTTTTCT TTACTGGAAG TGGCCTTTAG ATTAGAAGAC 11040 

TGAAAATAAG TTAAAGTCTT AAACTAATTT TCACAGCTAA GAATAGTAGA AGTTAATCTG 11100 

ATAAAAATCG AAAAAACCAG TGGAATTCTG TGTCAGGGTA AGTTCCACTG GTTTTCATAG 11160 

TCTATTAAAG TTCGAATGAA ACCTATTTAT AGTAGATTGA AACTAGAATA GTACACCTCT 11220 

AATTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TCTATTCGTT CTATTCTTAT 11280 

TTCATTTTAC TATATTTTGG TGCAATAAGT GAAAAGTAGT CCGAATAATA TAAGGATTGA 11340 

TTTTATAGTT TTTAAACTCA AATGAATTGA AATAAAGAGA GTACGAAAAT TCTCATCTGA 11400 

AAGTATTTTA GAATAATTCT CTTCGTGAAT TTCTTCAAAA CAGATAGCTT CATCTTAGGT 11460 

ATGTGATTTC TTTTTGCATT TTTGAGTTAG ATAAGGTATA ATGATTTTAT TGTCTTTTGG 11520 

GGTCGTTACG GATTCGACAG GCATTATGAG GCATATTTTG CGACTCGTGT GGCGACGTAA 11580 

ACGCTCAGTT AAATATAACT GCAAAAAATA ACACTTCTTA CGCTCTAGCT GCCTAAAAAC 11640 

CAGCAGGCGT GACCCGATTT GGATTGCTCG TGTTCAATGA CAGGTCTTAT TAT TAG CG AG 11700 

ATACGATTAA GCCTTGTCTA GCGGTTTGAT AAGAGATTGA TAGACTCGCA GTTTCTAGAC 11760 

TTGAGTTATG TGTCGAGGGG CTGTTAAAAT AATACATAAC CTATGGTTGT AGACAAATAT 11820 
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GTTGGCAGGT GTTTGGACGT GGGTTCGACT CCCACCGGCT CCATTATTCC TTTGCATTCT 1188 0 

TTTGCATTCC TTGGTAAAAC GTTGTTAAAT CAACGTTTTT TATTTTTATC TTTGGTATTC 1194 0 

CTTTGCATTC TTTTGCTAAA AAGGGAGTCA CAAACAGACC CTATTTTAAA AAAGGATAGA 12000 

AAAAAGGATA CAACATTTGT CGCATCCTAA AAATAATCTT TTTTCGACGG AAGACATGGG 1206 0 

ATTCGAACCC ACGCACGCTA TTACACGCCT ACCGCGTTTC CAACACGGCC TCTTAAGCCT 1212 0 

CTTGAGTAAT CTTCCAATAC TTACTCAAAT AGTCTACCAT AAAGGCTCTT ATCTTGCAAT 12180 

AAAAATTCTA GAAATAAGAA AAATGATAGA TTTTGAAAGA AAATGATAAA AAATGCTTGA 1224 0 

CTTCGAAAGA AAGTATGATA GAATGAATAG TGTAAACGAT AACAGGAGGT GATTCAGTGT 12300 

TAAAAACAGA ACGTAAACAA CTAATTTTAG AGGAGTTAAA TCAACATCAT GTAGTTTCTC 12 3 60 

TAGAAAAATT AGTTAGTTTG C T AGAAACGT CAGAATCAAC GGTTCGAAGA GACTTGGATG 12420 

AGTTGGAAGC GGAAAACAAG CTTCGTCGTG TGCATGGTGG AGCAGAACTC CCCTACTCCT 12480 

TACAGGAAGA AGAAACCATT CAAGAAAAAT CTGTCAAAAA CCTTCAAGAA AAGAAATTGC 12 540 

TGGCTCAGAA AGCAGCCTCT CTCATTAAAG AAAAAGATGT CATCTTTATC GATGCTGGAA 12600 

CAACAACTGC TTTTTTGATT CATGAATTGG TCAATAAGAA TGTTACAGTT GTGACCAACT 12 660 

CCATTCACCA TGCCGCTCAG TTGGTTGAAA AGCAGAwTCC AACTGTCATG GTTGGAGGAA 12720 

ACGTCAAGAC GGCGACAGAT GCTAGTATCG GGGGCGTTGC TCTTAACCAG ATTAACCAAT 12780 

TGCACTTTGA CCGTGCCTTT ATCGGAATAA ATGGTGTTGA CGATGGCTAT TATACGACTC 12 840 

CTGATATGGA GGAGGGAGCT GTGAAAAGAG CTATTTTGGA GAATGCCAAG CAGACCTACG 12900 

TCTTGGTGGA TTCGTCAAAA ATTGGACAAA CTTGCTTTGC CAAGGTAGCC CCACTCAAAC 12 960 

GCGCTATCGT TATCACTAGT CAAGGGCATG AGCTCTTGCA GGTTATTAAG GAGAAAACGG 13 020 

AGGTAATAGA AGTATGATTT ATACAGTCAC ACTCAATCCA TCCATTGACT ATATCGTTCG 13080 

TTTGGACCAA GTCAAAGTTG GTAGTGTCAA TCGTATGGAC AGTGATGATA AGTTTGCTGG 13140 

TGGGAAAGGA ATCAATGTCA GCCGTGTCTT GAAACGTTTG AATATACCAA ATACAGCGAC 132 00 

GGGATTTATC GGTGGCTTTA CTGGTAAATT TATCACAGAT ACTTTAGCAG AGGAAGAAAT 132 60 

CGAGACACGT TTTGTCCAGG TGGCAGAAGA TACTCGTATC AATGTTAAAA TCAAAGCAGA 13 320 

CCAAGAAACA GAAATCAACG GAACGGGTCC AACTGTTGAA TCGGTTCAGC TAGAAGAATT 13380 

GAAAGCTATT TTATCTAGTC TGACAGCAGA AGATACAGTT GTCTTTGCAG GTTCAAGTGC 13440 

TAAAAATCTA GGCAATGTTA TCTATAAGGA TTTGATTTCC TTGACGCGCC AGACTGGTGC 13 500 

GCAAGTGGTC TGTGACTTTG AAGGACAGAC CTTAATTGAT AGTTTGGACT ACCAGCCTCT 13 5 60 

TCTTGTAAAA CCAAACAATC ATGAACTTGG AGCGATTTTT GGGGTTAAAC TCGAAAGTTT 13 620 
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AGATGAAATT G AG AAATAC G CTCGTGAGTT ACTGGCTAAG GGTGCTCAAA ATGTTATTAT 13 680 

CTCTATGGCT GGTGATGGTG CCCTTCTTGT CACATCTGAG GGAGCTTACT TCGCTAAACC 13740 

AATCAAAGGA ACAGTCAAAA ATTCAGTTGG AGCTGGTGAT TCTATGGTTG CTGGATTCAC 13800 

AGGTGAATTT GTCAAATCAA AAGACGTAGT AGAAGCCTTC AAATGGGGAG TGGCTTGCGG 13860 

AACGGCAACT ACCTTCTCAG ATGACTTGGC AACGGCGGAA TTTATTAAAG AAACATATGG 13920 

AAAAGTTGAG GTAGAAAAAC GATGAAAATT CAAGACCTAT TGAGAAAAGA TGTCATGTTG 13 98 0 

CTAGATTTGC AGGCAACTGA AAAAACAGCT GTCATCGACG AGATGATTAA AAATTTGACA 14040 

GACCACGGTT ATGTAACAGA TTTTGAAACA TTTAAAGAAG GAATTTTGGC GCGTGAAGCT 14100 

TTGACTTCTA CTGGTTTGGG TGATGGAATC GCAATGCCTC ACAGCAAAAA CGCTGCTGTC 14160 

AAAGAAGCGA CAGTTCTATT TGCTAAGTCA AATAAGGGTG TTGACTACGA GAGCTTGGAT 14220 

GGACAAGCAA CTGACCTCTT CTTCATGATT GCAGCTCCAG AAGGTGCCAA TGATACTCAC 14280 

TTGGCAGCCT TGGCAGAATT GTCTCAATAC TTGATGAAAG ACGGTTTTGC AGACAAACTT 14340 

CGTCAAGCAA CATCTGCAGA CCAAGTTATC GAACTTTTTG ACCAAGCTTC AGAAAAAACT 14400 

GAGGAACTTG TTCAAGCACC TGCTAATGAC TCTGGTGACT TTATCGTAGC TGTTACAGCT 144 60 

TGTACAACAG GTATTGCCCA CACTTACATG GCCCAAGAAG CCCTTCAAAA AGTAGCTGCT 14 520 

GAAATGGGGG TTGGTATCAA GGTCGAAACC AACGGTGCTA GCGGTGTTGG AAATCAACTA 14580 

ACTGCAGAAG ATATCCGTAA GGCTAAAGCT ATTATCATTG CAGCAGACAA GGCCGTTGAA 14 640 

ATGGATCGAT TTGATGGAAA ACCATTGATC AATCGTCCAG TTGCTGACGG TATCCGTAAG 14700 

ACAGAAGAGC TAATTAACTT GGCTCTTTCA GGAGATACTG AAGTCTACCG TGCCGCTAAT 14760 

GGTGCCAAAG CTGCAACAGC CTCTAACGAA AAACAAAGCC TTGGTGGTGC CTTGTACAAA 14 820 

CACTTGATGA GTGGTGTATC TCAAATGTTA CCATTCGTTA TCGGTGGTGG TATCATGATT 14 880 

GCCCTTGCCT TCTTGATTGA CGGTGCTTTG GGTGTTCCAA ATGAAAACCT TGGCAATCTT 14940 

GGTTCTTACC ATGAGTTAGC TTCTATGTTC ATGAAAATTG GTGGAGCTGC CTTTGGTTTG 15000 

ATGCTTCCAG TCTTTGCGGG TTATGTTGCC TACTCTATTG CTGAAAAACC GGGTTTGGTA 15060 

GCAGGTTTCG TGGCTGGTGC TATTGCCAAA GAAGGTTTTG CCTTTGGTAA AATTCCTTAT 15120 

GCCGCAGGTG GTGAAGCAAC TTCAACTCTT GCAGGTGTCT CATCTGGTTT CCTAGGTGCC 15180 

CTTGTTGGTG GATTTATCGC AGGTGCCTTG GTTCTTGCCA TCAAGAAATA CGTTAAAGTT 15240 

CCTCGTTCAC TCGAAGGTGC TAAATCAATC CTTCTATTGC CACTTCTTGG AACAATCTTG 15 3 00 

ACAGGATTTG TTATGCTAGC TGTGAATATC CCAATGGCTG CAATCAACAC TGCTATGAAT 153 60 
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GACTTCCTAG GCGGTCTTGG AGGAGGTTCA GCTGTCCTTC TTGGTATCGT CCTTGGTGGA 15420 

ATGATGGCTG TTGACATGGG TGGACCAGTT AATAAAGCAG CTTATGTCTT TGGTACAGGT 15480 

ACGCTTGCAG CAACTGTTTC TTCAGGTGGT TCTGTAGCCA TGGCAGCAGT TATGGCTGGA 1554 0 

GGAATGGTGC CACCACTTGC AATCTTTGTC GCAACTCTTC TTTTCAAAGA TAAATTTACT 156 00 

AAGGAAGAAC GTAACTCTGG TTTGACAAAC ATCATCATGG GCTTGTCATT TATCACTGAG 15660 

GGAGCGATTC CATTTGGTGC CGCTGACCCA GCTCGTGCGA TTCCAAGCTT CATCCTTGGT 15720 

TCAGCAGTAG CAGGTGGACT CGTTGGTCTT ACTGGTATCA AACTCATGGC GCCACACGGA 15780 

GGAATCTTCG TTATCGCCCT TACTTCAAAT GCTCTCCTTT ACCTCGTTTC TGTCTTGGTA 15840 

GGAGCAATCG TAAGTGGTGT GGTTTATGGT TACCTACGCA AACCACAAGC ATAAAAAATA 15900 

GAAAAATGAA AAGATTGGAC CGTTTGGTGC AGTCTTTTTC TCTTCCCGAA ATGCCTGTGA 15960 

AATATGGTAT AATAGAAGAA TGGCAAACAA GAATACAAGT ACAACAAGAC GGAGACCGTC 16020 

TAAAGCAGAA CTGGAAAGAA AAGAAGCGAT TCAACGAATG TTGATTTCGT TAGGAATTGC 16080 

GATTTTATTG ATTTTCGCAG CCTTCAAATT AGGGGCTGCA GGTATAACCC TTTATAATTT 16140 

AATTCGCTTG CTAGTGGGTA GCCTAGCTTA TCTGGCGATA TTCGGCCTAT TAATCTATCT 15200 

CTTCTTTTTC AAGTGGATAC GAAAACAGGA AGGACTCTTA TCTGGCTTTT T C AC CAT AT T 16260 

TGCTGGCTTA CTCTTGATTT TTGAGGCCTA CTTGGTTTGG AAATATGGTT TGGACAAGTC 16320 

CGTTCTAAAA GGGACCATGG CTCAGGTTGT GACAGATCTG ACTGGTTTTC GAACGACTAG 16380 

CTTTGCTGGA GGGGGCTTGA TCGGGGTCGC TCTTTATATT CCAACAGCCT TTCTCTTTTC 16440 

AAATATCGGA ACTTACTTTA TTGGTTCTAT CTTGATTTTA GTGGGTTCTC TCCTAGTCAG 16500 

CCCTTGGTCT GTTTACGATA TTGCTGAATT TTTCAGTAGA GGCTTTGCCA AATGGTGGGA 15560 

AGGGCACGAG CGTCGAAAAG AGGAACGCTT TG7CAAACAA GAAGAAAAAG CTCGCCAAAA 16620 

GGCTGAGAAA GAGGCTAGAT TAGAACAAGA AGAGACTGAA AAAGCCTTAC TCGATTTGCC 16680 

TCCTGTTGAT ATGGAAACGG GTGAAATTCT GACAGAGGAA GCTGTTCAAA ATCTTCCACC 16740 

TATTCCAGAA GAAAAGTGGG TGGAACCAGA AATCATCCTG CCTCAAGCTG AACTTAAATT 16800 

CCCTGAACAG GAAGATGACT CAGATGACGA AGATGTTCAG GTCGATTTTT CAGCCAAAGA 16860 

AGCCCTTGAA TACAAACTTC CAAGCTTACA ACTCTTTGCA CCAGATAAAC CAAAAGATCA 16920 

GTCTAAAGAG AAGAAAATTG TCAGAGAAAA T ATCAAAAT C TTAGAAGCAA CCTTTGCTAG 16980 

CTTTGGTATT AAGGTAACAG TTGAACGGGC CGAAATTGGG CCATCAGTGA CCAAGTATGA 17040 

AGTCAAGCCG GCTGTTGGTG TAAGGGTCAA CCGCATTTCC AATCTATCAG ATGACCTCGC 17100 

TCTAGCCTTG GCTGCCAAAG ATGTCCGGAT TGAAGCACCA ATCCCTGGGA AATCCCTAAT 17160 
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CGGAATTGAA GTGCCCAACT CCGATATTGC CACTGTATCT TTCCGAGAAC TATGGGAACA 1722 0 

ATCGCAAACG AAAGCAGAAA ATTTCTTGGA AATTCCTTTA GGGAAGGCTG TTAATGGAAC 17280 

CGCAAGAGCT TTTGACCTTT CTAAAATGCC CCACTTGCTA GTTGCAGGTT CAACGGGTTC 17340 

AGGGAAGTCA GTAGCAGTTA ACGGCATTAT TGCTAGCATT CTCATGAAGG CGAGACCAGA 17400 

TCAAGTTAAA TTTATGATGG TCGATCCCAA GATGGTTGAG TTATCTGTTT ACAATGATAT 17460 

TCCCCACCTC TTGATTCCAG TCGTGACCAA TCCACGCAAA GCCAGCAAGG CTCTGCAAAA 17520 

GGTTGTGGAT GAAATGGAAA ACCGTTATGA ACTCTTTGCC AAGGTGGGAG TTCGGAATAT 17580 

TGCAGGTTTT AATGCCAAGG TAGAAGAGTT CAATTCCCAG TCTGAGTACA AGCAAATTCC 17640 

GCTACCATTC ATTGTCGTGA TTGTGGATGA GTTGGCTGAC CTCATGATGG TGGCCAGCAA 17700 

GGAAGTGGAA GATGCTATCA TCCGTCTTGG GCAGAAGCCG CGTGCTGCAG GTATCCACAT 17760 

GATTCTTGCA ACTCAGCGTC CATCTGTTGA TGTCATCTCT GGTTTGATTA AGGCCAATGT 17820 

TCCATCTCGT GTAGCATTTG CGGTTTCATC AGGAACAGAC TCCCGTACGA TTTTGGATCA 17 880 

AAATGGAGCA GAAAAACTTC TTGGTCGAGG AGACATGCTC TTTAAACCGA TTGATGAAAA 17 940 

TCATCCAGTT CGTCTCCAAG GCTCCTTTAT CTCGGATGAC GATGTTGAGC GCATTGTGAA 18 000 

CTTCATCAAG ACTCAGGCAG ATGCAGACTA CGATGAGAGT TTTGATCCAG GTGAGGTTTC 18060 

TGAAAATGAA GGAGAATTTT CGGATGGAGA TGCTGGTGGT GATCCGCTTT TTGAAGAAGC 18120 

TAAGTCTTTG GTTATCGAAA CACAGAAAGC CAGTGCGTCT ATGATTCAGC GTCGTTTATC 18180 

AGTTGGATTT AACCGTGCGA CCCGTCTCAT GGAAGAACTG GAGATAGCAG GTGTCATCGG 18240 

TCCAGCTGAA GGTACCAAAC CTCGAAAAGT GTTACAACAA TAAAAAAATA GCTTCTTTCC 183 00 

AAGTTTGGAG GGAAGCTATT TTAGTGGCTA TTGATTGCTT TTATTTTCTG AAGTTGGCGC 18360 

ATTGGACTGT TTTTCGTTTT CAGTAGCAGG TTTACTTGAA GCAGGAGTAG AAGAGTCCTG 18420 

AGTTGCTGTT TTCTGATCTT CTTTTTTCTC TTCCTTGACG CTAGATTTTG GTGTTTCCTC 184 80 

TTGCTGTGTT TTTTCTTGAC TAGTGTTAGT CTCTTTAGTT GGACTGGTGT TTTCCTTAGG 18540 

GGATTCCTTT TGGATTTCTT TGACAATGGT TGTCGTCTGG CTTGTCGTAG GTTCTTTTTT 18 600 

AATATTTTTG TTATTATCCA AGGCGTT 18 627 
(2) INFORMATION FOR SEQ ID NO : 114; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2560 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

TAAAATACGT TACCTTGCTT CTGCACGTTC AGCAGGTAAG TCATTGAAAT TTAAAGATCA 60 

AGATATTACA ATTGAAGAAA CGACTGAAAC AGCTTTTGAA GGAGTTGATA TTGCTCTCTT 120 

TTCAGCAGGT AGTTCTACAT CAGCTAAGTA TGCACCATAC GCAGTAAAAG CTGGCGTGGT 180 

AGTAGTAGAT AATACATCTT ATTTCCGTCA AAATCCAGAT GTTCCTTTGG TTGTTCCAGA 240 

GGTCAATGCT CATGCACTTG ATGCTCACAA CGGAATCATT GCCTGCCCTA ATTGTTCAAC 300 

AATTCAAATG ATGGTGGCTC TTGAGCCGGT TCGCCAAAAA TGGGGCTTGG ACCGTATCAT 360 

TGTTTCAACT TATCAAGCCG TTTCAGGTGC TGGTATGGGA GCAATTCTTG AGACACAACG 420 

TGAACTTCGT GAAGTCTTGA ATGATGGTGT GAAACCACGT GATTTGCATG CGGAAATCTT 4 80 

GCCTTCAGGT GGTGACAAGA AACATTATCC TATCGCCTTT AACGCTCTTC CACAAATTGA 54 0 

TGTTTTCACT GATAATGATT ACACGTACGA AGAGATGAAG ATGACCAAGG AAACTAAGAA 600 

AATTATGGAA GATGATAGCA TTGCAGTATC TGCAACATGT GTGCGTATTC CAGTCTTGTC 660 

AGCTCACTCT GAGTCTGTTT ATATCGAAAC AAAAGAAGTG GCTCCAATCG AAGAAGTAAA 72 0 

AGCAGCTATC GCAGCCTTCC CAGGTGCTGT TCTTGAAGAT GATGTAGCTC ATCAAATCTA 780 

TCCTCAAGCT ATCAATGCAG TTGGTTCGCG TGATACCTTT GTTGGTCGTA TCCGTAAAGA 84 0 

CTTGGATGCA GAAAAAGGAA TTCACATGTG GGTTGTTTCA GATAACCTTC TCAAAGGTGC 900 

TGCTTGGAAC TCAGTTCAGA TTGCTGAAAC TCTTCATGAA CGTGGATTGG TTCGTCCAAC 960 

AGCCGAATTG AAATTTGAAT TAAAATAGTC ATATCGTTTA GGAGTTCAGA TGAACTCCTT 102 0 

CTTTGAAATA GAGAGGTGTT TTCGTGTCTT ATCAAGATTT AAAAAAATGT AAAATCATTA 10 80 

CAGCCTTTAT TACCCCCTTC CATGAGGATG GTTCCATTAA CTTTGATGCT ATTCCAGCCT 114 0 

TGATTGAGCA TTTATTGGCC CATCATACGG ATGGAATTCT TCTCGCAGGA ACGACTGCTG 12 00 

AGAGTCCAAC TTTGACCCAC GATGAGGAGT TGGAGTTGTT TGCGGCTGTA CAAAAGGTTG 12 60 

TCAATGGACG CGTTCCTTTG ATTGCGGGTG TAGGTACTAA TGATACGCGT GACTCTATTG 132 0 

AGTTTGTCAA AGAAGTAGCG GAATTTGGTG GTTTCGCAGC TGGGCTTGCT ATTGTTCCTT 13 80 

ACTACAACAA ACCTTCTCAA GAAGGGATGT ATCAGCACTT TAAGACTATT GCAGATGCTT 1440 

CTGACCTACC AATTATTATC TATAACATTC CAGGGCGTGT AGTTGTCGAA TTGACTCCAG 1500 

AAACCATGCT TCGCTTGGCT GACCATCCAA ATATTATCGG TGTCAAAGAA TGTACTAGCT 1560 

TGGCTAATAT GGCTTACTTG ATTGAGCACA AGCCTGAAGA GTTCTTGATT TATACAGGTG 162 0 

AGGATGGAGA TGCTTTCCAT GCCATGAACC TTGGGGCGGA TGGGGTTATT TCTGTTGCCT 168 0 
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CTCATACAAA TGGGGATGAA ATGCACGAGA TGTTTACTGC GATTGCAGAA AGCGATATGA 174 0 

AGAAAGCCGC AGCAATTCAG CGTAAATTCA TTCCTAAGGT TAATGCTCTC TTCTCTTATC 1800 

CAAGTCCTGC TCCAGTTAAG GCAATTCTTA ACTATATGGG ATTTGAAGCT GGACCCACTC 18 60 

GTCTACCTCT TGTTCCAGCA CCAGAAGAAG ATGCCAAACG CATTATCAAG GTTGTCGTAG 192 0 

ATGGCGACTA CGAAGCAACT AAGGCAACTG TAACAGGGGT CTTAAGACCA GATTACTAAT 1980 

AAAGACAATA AAATCCGGCT CTTTGTCAAC TGTAGTGGGT TGAAGTCAGC TAAGCTCGAG 204 0 

AAAGGACAAA TTTTGTCCTT TCTTTTTTGA TATTCAGAGC GATAAAAATC CGTTTTTTGA 2100 

AGTTTTCAAA GTTCCGAAAA CCAAAGGCAT TGCGCTTGAT AAGTTTGATG AGATTATTGG 2160 

TCGCTTCCAA TTTGGCGTTT GAATAGGGTA GTTGAAGGGT GTTGACGATT TTCTTTTTGT 2220 

CCTTTAGAAA GGTTTTAAAG ACAGTCTGAA AAATAGGATG AACCTGCTTC AGATTGTCCT 22 80 

CAATGAGTCC GAAAAATTTC TCCGGTTCCT TATTCTGAAA GTGAAACAGC AAGAGTTGAT 234 0 

AGAGCTGATA GTGATGTTTC AAGTTTTGTG AATACCTCAA AAGCTTGTTT AAAATCTCTT 2400 

TATTGGTTAA GTGCATACGA AAAGTAGGAC GATAAAATCG CTTATCACTC AGTTTACGGC 24 60 

TATCCTGTTG AATGAGTTTC CAGTAGCGCT TGATAGCCTT GTATTCGGGA TTTTCGATGA 252 0 

AACTGATTCA TGATTTGGAC ACGCACACGA CTCATAGCAC 25 60 
(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11303 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 

TATTGGATTT CCCTTGCAAT CAGTTTATGG GACAAGCACC CGGCAGCGCA GAGGAAATCA 60 

ACGCCTTCTG TAGCCTACAT TTTCAAACCA CCTTCCCACG TTTTGCCAAG ATTAAGGTCA 12 0 

ACGGTAAGGA AGCAGACCCT CTCTATGTCT GGTTACAAGA CCAGAAATCC GGCCCACTAG 18 0 

GAAAACGAGT CGAATGGAAT TTCGCTAAGT TTCTCATCGG TCGAGATGGG CAAGTCTTTG 240 

AACGCTTTTC TTCAAAAACA GACCCAAAAC AAATTGAAGA GGCGATACAA ACTCTACTAT 300 

AATTCACAAT CTCACTATGA TTAGGTTTCC TTTAACCTGA TGAATAGTGA GATTTTTTGA 360 

TGGGCTTTGA CTTAAATAGA AAAACACCCC ATGATATGAA ACATGAAGTG TTGTAAAGTC 42 0 

TATGTTGTAG GTGCTTATTT CACAATTTCA ATGTGACCAG TGATAACGAA TACCATACAG 480 
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AATCTTCATA TACACTAAAC AAATGACTTT CTAATTATTT CAATTAGTTT TGGCTAGTAA 54 0 

ATATCATTTC CAACAAACGC CCTCTCAATT CCTTATCCTG ATGATGCAAG ATATTCATTA 600 

AGTCATGAGA GTTTTTCGCA TTGATGAATT GATTTAACAA TCTATCTTTT AATTCATATG 660 

GAAGAGAAGC TGTCTTTAGT AGTCTAAAAA CTTCGTCATT TAAAGATGTC CTTTTATTAT 72 0 

CTTTCCATTC AAATTTAGCT GTATCATTCT TATTTGGCAA TTCAATTATA GACACATTCG 78 0 

TTCCTTTAAA ATGAATTCTA TGTTTTCTAT TGCTTGGAAC GATACTAGAA TCTCCTTGTA 84 0 

ATGCTAACTC TACCATTCCC ATTTCCCAAT CGATTGATAA TCTTGTTTTA TATCTTTGAC 900 

CATTTTGATC TTCAAGCATT TCAAAAGAAT GTTGTTTTCC TGGGAATACA TACCAATCTA 960 

CAACTTCAGG TAAATCAACA CCCATACCTA TCTCAGAACC AACCAAGGGA ATGATTGCAC 1020 

CACTTTTTGC AAACACAGGC GTAGTCGAGA TGTCCCTATA AACACTTAAC TTCACACCAC 1080 

CTGTGTATTT TTTCTCTGAA AAGAAGTCAT ACCATTCACC TTCAGGGAAC CATACATCTA 114 0 

CTTTTGCAGA TTGGAATGTC AAATCCATCT TTTCTACAAT GGGAGCCACC ATCAGTTCTG 1200 

TTCCAAAAAA GTATTGGTTT GGAACATTAT AGCTCTCATC ATTCTCTGGA TAGAAATAAT 1260 

AGATTGGACT GATTAATGGG GCACCTTCCT CATGTGTCTG TACATTCATG GTATATAGAT 1320 

AGGGAATCAT CTGATGTCTC AAACGAAGGT ATTTCTTCAT AATCTTAGAT GTTGTTTCTG 13 80 

AAAAAAACCA AGGTTCTTTA CTATTAAAAG GACTTCTAGA ACTATGTAAT CGAGTAATCG 1440 

GACTAAAAAC ACCAAACTGT AGCCATCTAG TTTGTAGCTC TTCGTCATAA TCCCCCAACA 1500 

TATGTCCACC GATATCATGA CTCCACCAAC TATAACCGAT ATTAGATGCT GTCGCTGTAA 1560 

AATAGGGTTG AAATCTTAAG GAATTCCAAC TAATAATAGT ATCCCCTGAA AAACCAACAG 1620 

GGTAGCGGTG ACTACCAGGA CCTGCATATC TTGATAAAAT CAAACCACCT TCTGCATTTT 1680 

TACAACTATC CTGATAGTGA TAATGGTTTA AAAGCCAAAG TGGATCTAGC ATACCTTGTG 1740 

TCCCTTGTTG CCAGTCAATC CACCAAAAAT CTACTCCCTG CTTTTCTAGT TCATAATGAA 1800 

CATCTTTAAA GTAGGCTTCC CTAAAAGAGG GATTAAAAAA AT C AAAAATA GCAGGTTCTT 1860 

CTAGTTCTAC ATTTAACCCC AACCGTTTTG CGATTTGAGG ATAAGCTTCT TCATAAGCCC 1920 

GTATCCCATC AGCAGGATGG ACATTTAAGG AGAGTTTTAG CTTTCTATCA TGAAGTTGTT 1980 

GCAATAACTG TTCTGGATTT GGTATTAAGT TTCTATTCCA ACTATATCCT GTCCAGCCAC 2 040 

TTCCAAAGCG AGCTGGAATG TCAGTTATAT GCCAATCCAT ATCTAACACA CCGATAGATA 2100 

ATGGAATTTT CTCTGTTTCA AATCTGTCTA TTAAATCCAA GTATTCATCC GACGTATAAG 2160 

GCCAATATCT ACTCCACCAA TTGCCTAAAG CATATCTTGG CAACAAGGGT GTTGAACCAG 2220 

TCAAATGGTA AAAATCTCTG ATTGCTCCTC TATAATCATG CCCATAGGCA AAGAAATACA 2 2 80 
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GGTCAATTTG ATTTTCTCTC TCAATATAAC CAGATTGTTC ATCCCAAATA AATCCTTGAG 2 340 

AATCATCCAA TAAGGCTATA CCATTTCGGC TAATAATTCC ATCTTCTAAC GAGATTGCTC 24 00 

CATCTGCCTT ATCCAGAGTC CGAGCTGTTC CTTTTAACGT TTCAATAGAT TCACCAAAAT 24 6 0 

ACCAGCGACT ACCATATACG GCAAAATTTC CTTTTAATTC TATAAATAAA TTTTCGGCGT 252 0 

TAAATTCTCC TTTATTAAAG TGCAGATGAA AATAGTCCGT CATAATATCT AGTACGTTTG 2580 

ATGTCTCGAT ATAATCTAAC GAAATTTGGC CAAAATCTCT ATTATAGATA AGTTGTGTCG 2 64 0 

TTCTATCCTC AAAACTTCCA GTTTGAGAGT ATTCTAACCT TACTAGCTTG TCTGTTAATA 27 00 

CAGAGATTCG ATAAAACTCT CCCTTAAAAA TTTTCAATTT GTTTTCCTCC TTTTATGGTA 27 60 

GCATAAAAAC AGAACGCACC ATTTTTGATG CGTTTTTCAT TATTCTGAAT GCAATGTTCT 282 0 

ATCTGTTATA TCTATGACAA ATAATAGTCA ATTGAAAAAA TGCAGTGGAC AAAATATCTT 2 8 80 

TTAACAAACC AAGAGTTTAT TAAAGAGTTA TCACTTTTCA ACTTTTCTAA GCTTATGCAG 2940 

TTGTGAAACA AACTACTTTT AAACTATTAA CTAAGATAGG ATTGATAAAT AATTTCAAAC 3000 

TCTTACTAGC AATCATACGA TATTCAAGCT CACGTGCTTT TTTCCTTCCT GCTTATTTCT 30 60 

TAGAACTGAA GAACCCGGAT CGGTATATAA ATTATCCGGA TCAACATAGT CATAAGATTC 312 0 

ATAACAGTTG CGCTTCATTA AGTCATCCCC AGAGCAAGAG CTTCATCTCG TAATTTTTCA 3180 

ACATCACTAA CCGTAGGTCG CCATCCTTCA ATCATATTTG TACTTAAAGC ATACCAAACA 3240 

CTCTTAAAAA CGGATCGGTT TTCAAAAGCT ATTCCCATGA TTGTCATCTT TTCTTTATCT 3300 

ATATCTAAGG ACATATGCTA CCTCCTTTAG ATACATTATA CCATGTTTCT CTGTAGCTTT 33 60 

TAAAAATTTT ATTTTGTTTG TCATATCTAA GTTTTCAGCA CGCTTATCCT ATTTTATAAG 342 0 

CCTCAAACCC AAATATAAAA CGCATTCTTT TTGCTTTTTT ACTATTGTAT CGTATTCTAC 34 80 

GATAACATAC TTTACTTTAT TGTTTTTTTA AATAACAGCA GTTCCCTGTT TATCAACTAT 354 0 

TCGAACTACT TTCTATTTTG CTTCATACCC TACATAGCGA AAAAATATGA AAAAGCAGAG 3600 

AAGAATATCT TAAAAAGACC TCTTCACTGC TAATATTAAC ACTCATTATT TAAACTATAT 3 660 

GGATTCTATC ATCGAGTATA CTTTTTTACT TATTAGATAC CTTGCTCTTC TTTCACCAAT 372 0 

TTTTGATCAT ATACACGGAT GAATGGAAGA TAGACTAGGA ATGCTGCAAA TGCACATACT 37 80 

AGAGCAACTA ATACAGCTCG AAGATCTGCT GTCCCTAAGA AAGCTCCAAT CCCTACTGGA 3840 

GTTGGCCATG GAACCTGTGC GATAATTGGC TTAATAAAGT TTAGAGAATT CGCTACGTAA 3900 

TAAATAGTAG CAGTAACCAT TGGTGCTAAA ATAAATGGTA TAGCCAAGGC TGGATTATAG 39 60 

ATAATAGGTA ATCCAAAAAT TAATGGTTCA TTAATATTAA ATAAGGCTGG AACTACAGAT 402 0 
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GCTCGTCCTA TTGCTTTAAG CTGTTCAGAT TTAGAGGCAA AAGCAATATA TAAACATAGT 4080 

CCTAAAGTTG CACCAGAACC ACCTGCAATT ACAAACATAT TAGAAAATTC ACCTGCAACA 4140 

GCGAAGTGCC CGCCAGCAGC ATTTTCAGCC ATGTTAGCAA GAGCAATTGG ACTAACAAAT 4200 

GCAAAAACAA TGTTCGCACC GTGGATACCT ACAATCCAAA GTAGTTGAGT CAATAGATAA 4260 

ATAATCATTA AACCAATCCA CGAATTAGTC AGATTGGATA CAAAACCAAA TGGAATTGCA 4320 

ATGACTTTAA AAATATCTGT TCCCATTGCT ACAAGAAGAC CGTTGATAAA GATAACAACA 43 80 

AATGCAACAA CAAATCCCGG AACCAAAGCG GTAAATCCAC GAGAAACTCC TTCTGGAACA 4440 

GCTTCAGGCA TTTTAATAAC CCAATTATGT TTAACACACA TACGATAAAT AAGAACAGTC 4500 

ACAATTGCCA TAATGATTGC GGTAAAAATC CCTGTTGTCC CAAAACGTGC GACTACATTT 45 60 

CCCATTGCCC ATCCATCTGC AATTACTGCA CCTTCTTTTA GACTTGTCAC AGTCTTCATC 4 620 

ATTCCACCAT CAAAAATGAT TTGCGGTACT GTCATGACAA AAGCCATCAA GGCAAGCAAG 4 680 

GCACCATTAA GAGGATTCAT ATTGAGTTCT TCTTCCTCTG CATAAATTTT TGTCAATTCA 4740 

TATGCAAGTG ATAGAACGAA ATAAAGAGAT AGAGAACCCA TAGTCGCATA GTTTGCAACC 4 800 

ATGTAAAGTG ATGTGAATTT ATCAAATGAA GCAGAGAAAA TATCTGCCAC AATTGGCCAA 4 8 60 

AATGAGAAAG CTTGTGGCAA AATACTGAAT ACCAAAAACA TTGATCCTAC AATAGTAAAT 4 920 

GGTACAGCAG CCATACCTGC AGCCGTGATA GCACGTACTA CTTTAAACTG AGCAAGTTTG 4 980 

CCCATTGGTC CCATAACATG GTTTTCAAGA AAACCAAACA ACCCGTTTTG TTGATCCATA 5040 

AATAGACCTC CTTAATAAAA CATAATAATT TTTACTTTCT AAAGACTAGT TTCAAATACA 5100 

AATTATACTA GATCAGGATT ATAAACTAAG TGAGTTCTTT TCCAATTGGA CAAATTGTTG 5160 

ATAAGCCTTA TCTGTTCGTT TATAAATTTT TTTAATTCTT CTAATGTCTA ACAAACTCAG 5220 

AACTAAACCT AATAGAAGAA CTACAAAAAC AAATAAACGT GCTACTTGGT TATTTTCAAA 52 80 

AATCGGAAAA AGATTCTTAA ACCAACTTGT CCAAGTTAAA ACAAGTAATC CTATTGAAAT 5340 

AAGCATTTGT ATTCTAACAA ACATTAGTGT TATTCCCAAC TTTTCTTTCC TATTTCCATA 5400 

AAGTTTAAAT TGTTCAACAG TTGCTAAAAT AGAAAATACT ATGAGCATAA TGGGGAAAAT 54 50 

AATAATAGGC GAGGGACTAA TAAACTGACT CAAAAGCCAA TAAATATTCC CAAAAAAGAA 5520 

GAGTGCTATT GAATAACGTA GAAGAAGATA TCGATTGAAA AAAGTATTAG TTAGAGCCAT 55 80 

CTCTCGACGT TGTTGTTCAA TCTTTTGTCG TTCTTTTTTA TCCATATCAT TTCCTCCTTA 5640 

TATAACAACA CATATTTAGT TAACTTTCTT ATAAAGAGCT AACATTTCCT TTGCTACTTC 5700 

TAATAATGTC ATAGTGGTCA TTAAATGATC TTGAGCATGT ACCATGATAA TTTCAATTTT 57 60 

AATTTCCACT CCACTTGCGT ATTCTTGCAA GAGTTTGGTT TGTGCATGAT GCGCTTCAAG 5820 
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AATTATCTCA TTTGATTGAT TTAATTTACT TTCTGCATCA TCAAAACTAC CTTCTCTCAT 588 0 

TTTTGCAAAT GCTTCATGTA TTTCTGACCT TGCATTTCCC GAATGCAGGA TAATTTCAAA 5940 

TGCTGCAACC TGCAGTTCCT CTTGATTCAT ATAAACCTCC TATTTTATCT TCTCAAATAT 6000 

GTTAATAAAA TCTTCAAAGT TATTGCAAGA TATTAGCTGA TTTTGCAATT CATCATTCTC 6060 

TGTCAGAGAG ACTATCTTTT TAGTCACAGT TGCCAAACCT TCGTTCCCAT ATATTGATGG 6120 

AGATAGAAGA AATACTAGCT GGACATGTGA ACTTTGATTA TCCCAGAGTA ACGAATCTTT 6180 

ACAAATTGCA ACCGAAACCT TTCCCTCTGT ACCAAAGGGC TGAATAGGAT GCGGAACTGC 6240 

AATTTTTTCA GAAAAAACAA CTGAACTTAA TTCTTCGCGC TGTTTAATTC CATAAAGTAA 6300 

AGATTGTTCA AACTCATTTG ATTCACCAAC AGATAAACTC TCAACCATCT TTTCAAGTAA 6360 

ATTTACCTTG TCTGATTCAG TACATATTAA AAAGTTTTCT TTACTAAAAT ACTGTCTAAA 6420 

GCCGTTGTTT TCAAATTTGT TAATCTTTGA TGATTGTACA TAACTAGAAA CTTGCATCTA 6480 

ATCCATAGCT TTTCTAATCA TTTCCATCTC ATCACTCTTA AGAAACACAC TAACTTTAAA 6540 

AACTGGGATT TGAAAATATA GATTTGATAA ATCAATAGC'T GAC.'ACTATAA AATCTATTCC 6 600 

TTTAAGTTTT TCTTGATTCA ATTCATAGTA GCCTATTACA TCAACAACTT CTACTCGCTT 6660 

CCCAAACTCC GTTTCCAAAC GATTTCTTAA CATTTGGGCT GCACCAAATC CTGTTGCACA 6720 

AATAGCAAGA ATATTAAACT TAGTACTCTC TTTGCTACGT TCCATAGCAG CTAAAAAGTG 6780 

AAGACTTACA TATGCTACTT CATCATCTGA TATTGTCCAC TCCAAGAACT TGTCCATATT 6840 

TGCAAGAATT TCTCTAGTCA TAAAGAATAT ATCACTATAA TTCTGTTTAA TTTCATCTAC 6900 

CAAAGGGTTA TTTAAGGTAA TCCGGCTTTC TAAACGTACT TGTAGTGTCA TTAGATGAGT 6960 

TATCAATCCT TCAATTAGTT GGAAATCTGA AGAAAAGTTA T AC AT AT CAT CTAATCCTAA 7 020 

ATTCTGAAAT GTTTTAAATA AAGATTTTTT TAAAACTTCT TCAGAAATAT TCTTCTGATT 7 080 

TTTTTGACAT TGTTGACTCT TAGCTAACAA ATGCAAAGTA ATGTAGTCTA TTTCCTGAAC 7140 

TGGAAATTCC TGATTTGTTA CTTCTCTTAC TTTAGAAAGA ATTCTTTGGG CAACCTTTCT 7200 

CTCTATTGCA TCATCAGTCA TCTGACAGTC TATATTTTTT ATTTCAAATC CGGATTTTAA 7260 

ACGAATCACA GACAATGCTA TGTGAACTAC TAAATTCTGT AGTACAAAAT CAGATAGTTT 7320 

TAGGTTGGCC TCTTGGCATT CATCCAAAAC AATTCTAGCA AATTCTTCTA ATGGAACAGT 7380 

TTGATCAAAA AAGTTAAATT TTACATAGCA ATGTATTGTT TTAAAAAATT GATTCTCTAG 7440 

GAAATAATTT ATGATAAAAC GTCGTTTATC ACGTTCCTCG CCTGAGACAT AAACTCCTTT 7500 

ATTCGCCCTA CTCTCAATGG ACAAATTATA CTCTGATAAC ATCACTCGTA TCTTTCTGAA 7 560 
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ATCATGAGAT AATGTTGAAC GACTAACGTA AAGTTCATCA GCTAAATCAT CAAAAAGAAC 762 0 

TGGAACTTGC TCAAATAATA ATTTATTTAA GATAAATACT AAACGAT CAT CACCTTTTGA 7680 

AACCGCAGTT TTCGTATAGT CTTCTTCCAG TTCATAAGTT TGTCTAAACT CCTGGTAAGC 7740 

GCCTTGATTC TCAAAAAATA TTTGATACCC TTGACCTTGT TTTGAAATCA ACCGGACTCC 7800 

TTGAATAATC ATTGTCTTCT CAATTAATTT CAGTACATTA CGGACAGTTC TATCTGAACA 7860 

GGATAAATAT TCTGCCAGTT CTTTGCTTGT AACAAAACGT TCCTTATTTT TTATTAAAAA 7 920 

TTGAAGGATA TCTTTCTCTT TAATGTTTAA CACATTCATT CCCTCCTAAA ACGTATGTTT 7 980 

TCATATATTG AAGCATATTA TACACTTAAA TCAGTTTATA TCAAACTCAA AACAATTTAT 8040 

CTTAACCTAA ATATTTATTG ACATTTCATG TGTTCATCAA ATATTCTCAA GAATCAAATT 8100 

AGCCATTTTT TCAATTCCCA TTGGAATAGG AATATAGGCT TGAGGAGGTA TTTGTACAAC 8160 

TGGTTTTCCT GCTTTAGAAC CAGCCTCTTC AAATTGCTTA AAGTACATTT TTGTTTGAGG 8220 

ACTGACAAGA TACAAATCAA AAGCTGCTGC TGCGATAGCT TTCCCTCCTT CAGTAGCACT 8280 

AATAGCATCA ACTACAATAT CTTTCCCTTT TCCTTTTAGA AACTCTGTTG TTTTCTGTGC 8340 

CATAAGTGAT GAAGACATTC CTGCTGCACA AATAATTAAA GCTTTTGCCA TAATATTTTC 8400 

TCCTTTTCTT AAATCCAATC AAAGCTGTGC TAAGTTGGCT TATTTGTTAT CTATTTTTAT 8460 

TATAAAATAA AGCGTTTCCA ATGACAATTC CCTCATTTTC CTAAATGATA TGGAAAAAAA 8520 

TTATTTATAC TTCAATTTAT AAAATAAAAT TATTCCTGAG AGTAGAAATG AAACACTATT 8 580 

TGCTAAAATC AAAGGCAAGT CTCCTATACG AATACCATGA GCAAGCCACA ATGCAATACC 8640 

AATAACTTGC ATAACATACA TACCTAGAGC AATAGATCCT GTGTCCTTTG TCTTAACTAC 8700 

ACGAAAAACT TGTGGTAAAA ATGCAAATGT TGTTAAAATT GCTGCAATAC TTCCAATCAT 87 60 

ATGTCACCTC AATATGCTAA ACAAACTGAG AATAATCTCA GTTTGTTTAT ACTATTCTAC 8820 

TGATTCACCG TTAGATGAAA TAACTTCCTT ATACCAGCCA AAAGATTTTT TCGGGGAACG 8880 

ATTATAACTT CCCTTCCCAT TATCATCTTT ATCTACATAA ATAAAGCCAT AACGTTTCCG 8940 

CATTTCACCG GTACCAGCTG AAACCAAATC AATACATCCC CATGGAGTAT AACCCATTAA 9000 

ATCAACACCA TCTTCAACTA CAGCCTTTTT CATTTCACGA ATATGGGCAC CTAGATATTC 9060 

AATTCTATAA TCATCATGTA CCATACCATC TGCTGCAACT TGATCTATAG CTCCAAAACC 9120 

ATTTTCAACA ATAAAGAGTG GTAAGTGATA GTGGTCTGTA AACCAATTTA ACGCATAACG 9180 

CAAACCTTCT GGATCAATTT GCCACTCCCA TTCAGAAGCC TTAACATAAT TATTTTTCAC 9240 

TAAATCTTCT GTTTCAAGAT AATCAAAATA AGGATTATTT TCACGATGAG AGTCGATAGC 93 00 

AAAGGACATA TAGTAACTGA AACCAATGTA ATCTACAGTC CCACCAAGTA AATCTTCTTT 9360 
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ATCCTGGGCA GTAAAATCAA CTGAAATACC TTTTCGTTCC 
AGGATATTTA CCTAAAACAT GCACATCAGC AAAATAATAA 
TGCCATTAAG ATATCCTTAG GATTGCAAGT AACTGGATAA 
ACAACCTATT TGAAAATCTG GATTAATCTC ATGACCAATT 
AACTAATTCG TAATGTGCTG CTTGATACAT AATTGCTTCT 
TACAATACCT GAGTTAGTAA ATGGTGCAAA ATCTTCCTGA 
TTCATTGAAA GTCATCCAAT ATTTAACCTT ATCTTTGTAA 
AAAACGAGCA AAGAAATCAA TCAATTTCCT ATTTTTCCAA 
GTGATAAGGC ATTTCAAAAT GAGATAGAGT GATGACAGGT 
TTCATCAAAA AGATTATCAT AAAACTGTAA TCCTTCTTCA 
TTTTGGAAAG ATACGTGTCC ATGCAATAGA GGTACGGAAG 
AAAAAGTGCT ATATCTTCTT TATAACGGTG ATAAAAATCT 
ATATTTACCC TCTAAAACTC CCAAAGTAAT TTCACGAGCT 
CATAACATCA GCAACACTAA TTCCCTTGCC ACCTTCTTGC 
AGCAGCAACA GCACCACCCC ATAAAAATCC ATCTTTAAAA 
TGACTTTGAT ACTCTTATTA TAAACCTTAA ACCAAAAGAT 
TATTGTTAAG GAAAGAAGTA ATTTTTAATG GAAATAGAAC 
TAATGATATC TTTACGATTT TCAATACTTT CAAACTACAA 
ATTCCCTGTG TCTATAAACG ACTTATCGCT TTCTGGC'ATC 
ACGTTCAACT TGCATCTGCA AGTGATATTT TTTTCTTAAA 
CTTTGATTGA TAATGTTTAT CTAAAGTTTC TTGATTTATC 
AGTTCCCTCT TTTTCAATTG GTAAAAAATA TTCGTATTTC 
AATTTCTTTA ACAAGGCCAC TATCAAGCAT TTCTCTTGCA 
ATCACCTTTA TAATATACAT GAATAGTCAA TGTCATCTTA 
CAATTTTAAA AAAACAAGTT TAGATGAGGA TCTAAACTTG 
TAACGTTTCG CCATTACTTT CAATCACTTC TTTATACCAA 
GCGATTTATA GTCAATTGAA ACAAGAGCAG GACAAAAGAG 
CTTGGTAATA CCTTTTTGAG GTGCTTTTTG ATATGAGCCC 
TACTCAGGTG AGTAGGGAGG AAGAGGTAAA AGTTTATACC 



CAATACTTGA AAATATGCTC 942 0 

CGCTTCTGCA TAGCTTTCAT 9480 

ATTGGACACA TCGCAATCAT 9540 

TTTACAGCTC GTGCAGAAGC 9 600 

CTATTATCAC CTTCCTCATA 96 60 

TAATTCGCTT GATTATTGAT 972 0 

CGTTTAAATA CGACTTCTGC 97 8 0 

CCACCATATT CGGTCACTAA 9 840 

TCAATACCAT TCTTTAAGCA 9900 

TTCGGCTCTA ACTCATCACC 9 9 60 

CACTTGAATC CCATTTCAGC 1002 0 

ATCGCCTCAT GATTTGGATA 10080 

ACTCCATGAC GACCAGCAGT 1014 0 

CATCCACCTT CAAGTTGATG 102 00 

GTAGTCATCT TTTTTCCTCC 10260 

GAAAACGCAT TCTTTTTCCT 103 2 0 

AATATCTTCT TGTATTCTCG 103 80 

AAACTCTCAC AATAATTCTA 10440 

CCAGAATCAT CTTCTATATA 10500 

TCTAAGATTT TCTGCATTGT 105 60 

CACTGATCAA TAAGGAGAAT 10 62 0 

AAGTTACCTT TTTGATTTCT 10680 

AACTTTATTG CACTATCTCC 10740 

TATCCTCCAA AATCATCCTT 10800 

TTTTTTATGA ACTAATTATC 108 60 

TAAAATGATT TTTTCTTATA 1092 0 

CCTCATAAAA GGTATTGCAA 10980 

ATGTTTTCTC AATAGGATTG 1104 0 

CAAACTCTTC ACACAAGAGT 11100 
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TCTAGCTTCC CCATTCTATG GAATCTTGCA TTATCCATAA TAATAACCGA TGGTGTGGTT 11160 

AATGTTGGTA AGAGAAACTT CTGAAACCAA GCTTCAAAAA AGTCGCTCGT CATCGTCTCT 112 20 

TCGTAAGTCA TTGGAGCGAT TAACTCACCA TTTGTTAGAC CTGCAACCAA AGAAATCCTC 11280 

TGATATCTTC TTCCAGATAC TTT 113 03 
(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3112 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116: 

CCTTAGATTT CCACTTGCCA GAGGAATTGA TTGCCCAAAC GCCCCTTGAA AAACGTGATG 60 

CCTCCAAACT CCTCATCGTC AACCGTGAGA CAGGAGAAAT GCAAGATAAA CATTTCCACT 12 0 

CTATTATTGA TATGCTGGAA CCTGGTGATG CCCTTGTCAT GAACGACACC CGAGTTCTCC 180 

CTGCCCGCCT CTATGGTCAA AAAGTGGAGA CAGGAGGTCA TGTGGAACTT CTCCTCCTTA 240 

AGAACACTAG TGGAGACGAG TGGGAAGTTC TGGCTAAACC TGCCAAACGC CTCAAGGTCG 3 00 

GTACTCGTAT CAGCTTTGGT GATGGCCGCC TCAGCGCTGT CGTTACAGAA GAATTGACCC 3 60 

ACGGGGGACG CATTGTCCGC TTTGAATACC AAGGAATTTT CCTAGAAGTC TTGGAAAGTC 420 

TGGGAGAAAT GCCTCTGCCA CCTTATATCC ACGAAAAATT AGATGACCGT GAACGTTATC 480 

AAACCGTCTA CGCCAAGGAA AGTGGCTCTG CTGCAGCACC GACTGCTGGT CTTCACTTCA 54 0 

CCAAAGAACT GCTGGCAGAA ATCCAAGCTA AGGGTGTTCA TCTAGTCTAT CTGACTCTCC 6 00 

ATGTCGGACT CGGAACCTTT AGACCTGTTT CTGTGGATAA TCTGGACGAA CACGAAATGC 660 

ACTCAGAGTT CTATCAACTT TCTGAGGAAG CTGCTGCCAC CCTTCGCTCT GTCAAAAAAA 72 0 

ATGGTGGTCG TGTCATCGCT GTCGGAACCA CTTCTATCCG CACCTTGGAA ACTATTGGTT 780 

CCAAGTTTGA TGGGCAAATC CAAGCAGATT CTGGTTGGAC CAATATCTTT ATCAAACCTG 840 

GGTATGAGTG GAAGGTCGTG GATGCCTTCT CAACCAACTT CCACCTGCCA AAATCAACTC 900 

TGGTCATGTT GGTTTCTGCC TTTGCAGGCC GTGAATTAGT CTTAGATGCC TACCACCATT 9 60 

CCATCCAAGA ACACTACCGC TTCTTCAGTT TTGGTGACGC CATGTTTATT TATTGAGAAA 1020 

GAATTTCTCT AAATCTTCTA ATACCAATAA ATCGCTAAGA TATTATTTCA AAGAACATCT 108 0 

ACAATTGAAA CTCTAGCTAG CTGTAGAAGA GGCCTAGTAC ATTGAAATTA AAATGCTTCC 1140 

CCCTAGCTTC GAAAATATTG CCATAGATTC- CGTTGACTCT CCAAATTGAT T CAT CT AT AT 1200 
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TTTATTTCAG CTTCCTATAC TTTCTTCGCT GTTTGTAAAT CAAAATGCAA GACACATGAG 12 60 

TAGCACCATA TTTGTTACTC TTATCTGTCC TCTCAAGAGA CTATTATGAG TTATTTCAGA 1320 

ATCATTCACT ACTTTGACCC TGACTCTCCT TAGTCTCAAA ATCAAAGACT TAT AC T C T TC 13 80 

AAAAATCTCT TCAAACCGCG TCAACGTCAC CTTGGATTAT ATATGTGatC TGaCTTCGTC 1440 

AGTTCTATCT ACAACCTCAA AGCAGTACTT TGAGCAACCT GCGACTAGTT TTCTAGTTTG 1500 

CTCTTTGATT TTCATTGAGT ATTAAACAAA AAGTGAACAA ATCTGAATTC TAATGTACAG 15 60 

AAGACTAGGC TTGTTCACTT TTTTATAGTC GCTATAAGAT GACCTTATCT ATAGCTTTTT 1620 

AT ATATAAT T ATATATTCAG ACATACTATT ATCAATTTTG TCGCAGGGAG GAATCTGTTA 1680 

ACGCACCCAT TCACCATTAT CATTGACTCT ATAGCCATCT ATACTTGTAT TGACCGCTAA 1740 

CTCACCCGAT GTATTTACAT AATACCATTT ACCACCAACT TGGAACCATT GATTGACTTT 18 00 

CATAGAACCG TTGCTGTTGA GGTAGTACCA TGAACTATTA ACTTGTACCC AACCTGTTGC 18 60 

CATGGAACCA TCAGTATTAT AAAAATACCA CATACCATTT TCTTGTTTCC AGTCTGTTGT 192 0 

TGGAGCAACT GCTTTAGCTG GTTCTACTGC TACATCTGTT CCTTGGTTAG ATGTAACAGA 1980 

TACAGGATAC GAAGGAATAG ATGATTGCTC AGGAACAACA ACTTTTTCAG GTTCTCTCGT 204 0 

CCCTCTCCTT ATACGTCTTT TTACCATCTC TTTAGTAATT TGACGAGAAG TAGTTTCTTC 2100 

AATTGTTCCA TCACGTTCAT CTACAGTATA GATTGTAGTA AGAGTAATTT ACCAATTTCT 2160 

CCTACTTCTT CTACTTCTTG ACTTTTATCA AGAGTTGGGC CATCGAGATA TTCTGTTTCG 2220 

ATTGGAATTT CTTGGACAAG AACTTGGGGC TTGGTTCTTT TTTTAACAAC TCTTGTTTGA 22 80 

GAGTCTTTTT TTTGACTTAA AGTACTCTCA GTTACTTGTC CACTCTTTCC ATCTACATTA 234 0 

TAAGTTATCG TTGTAACTGT TTTCCCATTC TTTCCTAGAG TAATCTCTTG CTCCTGTCCT 24 00 

GCAGAAAGGT CATTGTCTGC TTCATATTTA GTAGCAAATG GAACAAGAAC TTCTTCAACC 24 60 

TTGCTTTTAG CTGGAACTTT GATAACTGTA TCCGTGGCTT CTTTTCTATC AACAGTAACC 2520 

TGTTCGGTAA CATAACCAGT CTCTGGATTA ACATCGTAGG TCCTTGTCGT AGTTACATAG 258 0 

CCATCCTCTC CATCAATTGT AACAGGATTT TCACTACGGT CTTTTGTTTC ATCTTTTTCA 2640 

TAACGAATTC GCGTACTTGA AATTTTCTTG GTTACTACCT TAGGTTTAGT CGCTACTTTT 2700 

ACAATAATAT CCCCATTGTC AGCGTCATCA TACTCTATTC CCTCTTCTTT ATCTCTAGTA 2760 

TCATCTCTGA CATATTGAAT CCCATCAGCA GCATGAACAA AACTTGTATT CAGATTCCTC 2820 

CTAAAAATAA AGTTAGCCCG ATTACCGCAG AACCAAAAAT CTTTCCGAGT TTACGTATTG 2880 

CATAGCGCTT ATTAGTATTA GATTTTGCCA TTACATCCTA CTTCTAGTAT AGCATCTTTT 294 0 
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CTATCAAACG TTAAACAATA TACGTTATAT ATAAAATAGA CTTAGAATGA TATATTGATT 3 000 

ATTGAACTAA CACTTTAACT ATATCGTAAT CAATCTCATA TATAAAGGAT TGCAGACATC 3060 

TTATCTAAAT ACATGCGAAT ATATTTAGAT ACAAACATTC CAACTTGATA AT 3112 
(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4327 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
(□) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 

CCCAAAAATC TCTTCAAACC ACGTCAGCTT CGCCTTGCCG TAGTATGGTT ACTGACTTCG 60 

TCAGTTCTAT CCACAACCTC AAAACAGTGT TTTGAGCATC ATGCgGCTAG CTTCTTAGTT 12 0 

TGCTCTTTGA TTTTCATTGA GTATAAAAAC AGATGAGTTT CTGTTTTCTT TTTATGGACT 180 

ATAAATGTTC AGCTGAAACT ACTTTCAAGG ACATTATTAT ATAAAAGAAT TTTTTGAAAC 240 

TAAAATCTAC TATATTACAC TATATTGAAA GCGTTTTAAA AATGAGGTAT AATAAATTTA 3 00 

CTAACGCTTA TAAAAAGTGA TAGAATCTAT TTTTATGTAT ATTTAAAGAT AGATTGCTGT 3 60 

AAAAATAGTA GTAGCTATGC GAAATAACAG ATAGAGAGAA GGGATTGAAG CTTAGAAAAG 420 

GGGAATAATA TGATATTTAA GGCATTCAAG ACAAAAAAGC AGAGAAAAAG ACAAGTTGAA 480 

CTACTTTTGA CAGTTTTTTT CGACAGTTTT CTGATTGATT TATTTCTTCA CTTATTTGGG 540 

ATTGTCCCCT TTAAGCTGGA TAAGATTCTG ATTGTGAGCT TGATTATATT TCCCATTATT fiOO 

TCTACAAGTA TTTATGCTTA TGAAAAGCTA TTTGAAAAAG TGTTCGATAA GGATTGAGCA 660 

GGAAGTATGG TGTAAATAGC ATAGGCTGAT GTCCATCATT TGCTTATAAA GAGATATTTT 72 0 

AGTTTAATTG CAGCGGTGTC CTGGTAGATA AACTAGATTG GCAGGAGTCT GATTGGAGAA 780 

AGGAGAGGGG AAAATTGGCA CCAATTTGAG ATAGTTTGTT TAGTTCATTT TTGTCATTTA 840 

AATGAACTGT AGTAAAAGAA AGTTAATAAA AGACAAACTA AGTGCATTTT CTGGAGTAAA 900 

TGTCTTATTT CAGAAATCGG GATATAGATA TAGAGAGGAT CAGTATGAAT CGGAGTGTTC 960 

AAGAACGTAA GTGTCGTTAT AGCATTAGGA AACTATCGGT AGGAGCGGTT TCTATGATTG 102 0 

TAGGAGCAGT GGTATTTGGA ACGTCTCCTG TTTTAGCTCA AGAAGGGGCA AGTGAGCAAC 1080 

CTCTGGCAAA TGAAACTCAA CTTTCGGGGG AGAGCTCAAC CCTAACTGAT ACAGAAAAGA 1140 

GCCAGCCTTC TTCAGAGACT GAACTTTCTG GCAATAAGCA AGAACAAGAA AGGAAAGATA 1200 

AGCAAGAAGA AAAAATTCCA AGAGATTACT ATGCACGAGA TTTGGAAAAT GTCGAAACAG 12 60 
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TGATAGAAAA AGAAGATGTT GAAACCAATG CTTCAAATGG TCAGAGAGTT GATTTATCAA 1320 

GTGAACTAGA TAAACTAAAG AAACTTGAAA ACGCAACAGT TCACATGGAG TTTAAGCCAG 13 80 

ATGCCAAGGC CCCAGCATTC TATAATCTCT TTTCTGTGTC AAGTGCTACT AAAAAAGATG 1440 

AGTACTTCAC TATGGCAGTT TACAATAATA CTGCTACTCT AGAGGGGCGT GGTTCGGATG 1500 

GGAAACAGTT TTACAATAAT TACAACGATG CACCCTTAAA AGTTAAACCA GGTCAGTGGA 1560 

ATTCTGTGAC TTTCACAGTT GAAAAACCGA CAGCAGAACT ACCTAAAGGC CGAGTGCGCC 1620 

TCTACGTAAA CGGGGTATTA TCTCGAACAA GTCTGAGATC TGGCAATTTC ATTAAAGATA 1680 

TGCCAGATGT AACGCATGTG CAAATCGGAG CAACCAAGCG TGCCAACAAT ACGGTTTGGG 1740 

GGTCAAATCT ACAGATTCGG AATCTCACTG TGTATAATCG TGCTTTAACA CCAGAAGAGG 1800 

TACAAAAACG TAGTCAACTT TTTAAACGCT CAGATTTAGA AAAAAAACTA CCTGAAGGAG 18 60 

CGGCTTTAAC AGAGAAAACG GACATATTCG AAAGCGGGCG TAACGGTAAC CCAAATAAAG 19 20 

ATGGAATCAA GAGTTATCGT ATTCCAGCAC TTCTCAAGAC AGATAAAGGA ACTTTGATCG 1980 

CAGGTGCAGA TGAACGCCGT CTCCATTCGA GTGACTGGGG TGATATCGGT ATGGTCATCA 2040 

GACGTAGTGA AGATAATGGT AAAACTTGGG GTGACCGAGT AACCATTACC AACTTACGTG 2100 

ACAATCCAAA AGCTTCTGAC CCATCGATCG GTTCACCAGT GAATATCGAT ATGGTGTTGG 2160 

TTCAAGATCC TGAAACCAAA CGAATCTTTT CTATCTATGA CATGTTCCCA GAAGGGAAGG 22 20 

GAATCTTTGG AATGTCTTCA CAAAAAGAAG AAGCCTACAA AAAAATCGAT GGAAAAACCT 2280 

ATCAAATCCT CTACCGTGAA GGAGAAAAGG GAGCTTATAC CATTCGAGAA AATGGTACTG 2 340 

TCTATACACC AGATGGTAAG GCGACAGACT ATCGCGTTGT TGTAGATCCT GTTAAACCAG 2 4 00 

CCTATAGCGA CAAGGGTGAT CTATACAAGG GTGACCAATT ACTAGGAAAT ATCTACTTCA 2 4 60 

CAACAAACAA AACTTCTCCA TTTAGAATTG CCAAGGATAG CTATCTATGG ATGTCCTACA 2 52 0 

GTGATGACGA CGGGAAGACA TGGTCAGCTC CTCAAGATAT TACTCCGATG GTCAAAGCCG 2 580 

ATTGGATGAA ATTCTTGGGT GTAGGTCCTG GAACAGGAAT TGTACTTCGG AATGGGCCTC 2 640 

ACAAGGGACG GATTTTGATA CCGGTTTATA CGACTAATAA TGTATCTCAC TTAGATGGCT 2 700 

CGCAATCTTC TCGTGTCATC TATTCAGATG ATCATGGAAA AACTTGGCAT GCTGGAGAAG 27 60 

CGGTCAACGA TAACCGTCAG GTAGACGGTC AAAAGATCCA CTCTTCTACG ATGAACAATA 2 820 

GACGTGCGCA AAATACAGAA TCAACGGTGG TACAACTAAA CAATGGAGAT GTTAAACTCT 2880 

TTATGCGTGG TTTGACTGGA GATCTTCAGG TTGCTACAAG TAAAGACGGA GGAGTGACTT 2940 

GGGAGAAGGA TATCAAACGT TATCCACAGG TTAAAGATGT CTATGTTCAA ATGTCTGCTA 3000 
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TCCATACGAT GCACGAAGGA AAAGAATACA TCATCCTCAG TAATGCAGGT GGACCGAAAC 3060 

GTGAAAATGG GATGGTCCAC TTGGCACGTG TCGAAGAAAA TGGTGAGTTG ACTTGGCTCA 3120 

AACACAATCC AATTCAAAAA GGAGAGTTTG CCTATAATTC GCTCCAAGAA TTAGGAAATG 3180 

GGGAGTATGG CATCTTGTAT GAACATACTG AAAAAGGACA AAATGCCTAT ACCCTATCAT 324 0 

TTAGAAAATT TAATTGGGAA TTTTTGAGCA AAAATCTGAT TTCTCCTACC GAAGCGAACT 3300 

AGAGAGATGG GCAAAGGAGA GATGGGCAAA GGAGTTATTG GCTTGGAGTT CGACTCAGAA 33 60 

GTATTGGTCA ACAAGGCTCC AACCCTTCAA TTGGCAAATG GTAAAACAGC GACTTTCCTA 342 0 

ACCCAGTATG ATAGCAAGAC CTTGTTGTTT GCAGTAGATA AGGAAGATAT CGGACAGGAA 3480 

ATTATTGGTA TAGCTAAAGG AAGCATCGAA AGTATGCATA ATCTTCCTGT AAATCTAGCA 3 54 0 

GGTGCCAGAG TTCCTGGCGG AGTAAATGGT AGCAAAGCAG CGGTGCATGA AGTTCCAGAA 3 600 

TTTACAGGGG GAGTTAATGG TACAGAGCCA GCTGTTCATG AAATCGCAGA GTATAAGGGA 3 660 

TCTGATTCGC TTGTAACTCT TACTACAAAA AAAGATTATA CTTACAAAGC TCCTCTTGCT 3720 

CAGCAGGCAC TTCCTGAAAC AGGAAACAAG GAGAGTGACC TCCTAGCTTC ACTAGGACTA 3 780 

ACAGCTTTCT TCCTTGGTCT GTTTACGCTA GGGAAAAAGA GAGAACAATA AGAGAAGAAT 3 84 0 

TCTAAACATT TGATTTTGTA AAAATGGCTC TTTGTCAACT GTAGTGGGTT GAAGTCAGCT 3 900 

AAGCTCGAGA AAGGACAAAT TTTGTCCTTT CTTTTTTGAT ATTCAGAGCG ATAAAAATCC 3 960 

GTTTTTTGAA GTTTTCAAAG TTCCGAAAAC CAAAGGCATT GCGCTTGATA AGTTTGATGA 4 020 

GATTATTGGT CGCTTCCAAT TTGGCGTTAG AATAGTGTAG TTGAAGGGCG TTGACGATTT 4080 

TCTCTTTGTC CTTTAGAAAG GTTTTAAAGA CAGTCTGAAA AAGAGGATGA ACCTGCTTTA 4140 

GATTGTCCTC AATGAGTCCG AAAAATTTCT CCGGTTCCTT ATTCTGAAAG TGAAACAGCA 4200 

AGAGTTGATA GAGCTGATAG TGATGTTTCA AGTCTTGTGA ATAGCTCAAA AGCTTGTTTA 4 2 60 

AAATCTCTTT ATTGGTTAAA TGCATACGAA AAGTAGGGCG ATAAAAATGT TTATCGCTGA 4 320 

GTTTACG 43 27 
(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 
CTCTGGCCCT GCCACTCCAA CGTTTTGTCA GGGTGCTTTT TTCATAAAGG AGTTCTTATG 6 0 
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TTAGATATCA AACGTATTCG TACAGATTTT GAAGCTGTCG CAGAAAAATT AGCTACACGT 12 0 

GGTGTAGATG CTGCTGTCTT GAATGAAATG AAAGAAATCG ATGCTAAACG TCGTAACATC 180 

TTGGTCAAGG TTGAAACTCT CAAAGCAGAA CGTAACACAG TTTCTGCTGA GATTGCCCAA 24 0 

GCTAAGCGCA ACAAGGAAAA TACAGATGAC AAGATTGCTG CCATGCAAAA TCTATCTGCT 300 

GAGGTTAAAG CCTTGGATGC TGAATTGGCA GAAATCGATG CTAAATTGAC AGAATTTACA 3 60 

ACGACTCTTC CAAATATCCC AGCTGACAGC GTTCCTGTTG GGGCTGACGA AGACGACAAT 42 0 

GTGGAAGTTC GCCGTTGGGG TACTCCACGC GAGTTTGACT TCGAACCTAA AGCTCACTGG 480 

GATCTCGGTG AAGACCTTGG TATCCTTGAC TGGGAACGCG GTGGTAAGGT AACAGGCGCT 54 0 

CGCTTCCTCT TCTATAAAGG CCTCGGTGCT CGTTTGGAAC GTGCTATCTA CAACTTTATG 600 

TTGGATGAAC ATGGAAAAGA AGGCTATACT GAAGTCATCA CACCTTACAT AGTCAACCAT 660 

GATTCTATGT TTGGTACTGG TCAGTATCCA AAATTTAAGG AAGATACTTT TGAACTCAGC 72 0 

GATACCAACT TTGTCTTGAT TCCAACTGCT GAAGTTCCTC TGACAAACTA CTACCGTGAT 780 

GAAATCTTAG ACGGCAAAGA TCTTCCAATC TACTTCACTG CCATGAGTCC GTCATTCCGT 840 

TCTGAGGCTG GTTCTGCCGG TCGTGATACG CGTGGCTTGA TCCGTTTGCA CCAATTCCAC 900 

AAGGTTGAAA TGGTCAAATT TGCCAAACCA GAAGAATCTT ACGAAGAATT GGAAAAAATG 960 

ACAGCCAACG CTGAAAACAT TCTTCAAAAA CTCAACCTTC CATACCGTGT CGTTGCTCTC 1020 

TCTACTGGAG ATATGGGCTT CTCAGCTGCG AAGACTTACG ACTTGGAAGT GTGGATTCCA 1080 

GCACAAAACA ATTACCGTGA AATCTCAAGC TGTTCAAACA CAGAAGATTT CCAAGCCCGT 1140 

CGTGCCCAAA TCCGTTACCG TGATGAAGCA GATGGCAAGG TGAAACTCCT TCATACCTTG 1200 

AACGGTTCTG GACTTGCAGT TGGACGTACA GTGGCTGCAA TTCTTGAAAA TTACCAAAAT 1260 

GAAGATGGTT CTGTGACCAT CCCAGAAGCA CTTCGTCCAT ACATGGGTGG AGCTGAAGTC 1320 

ATCAAACCAT AAAAAATAAG GTTTAGCTAT TTCTAGCTAG ACCTTTTTTC GTAACCAAAT 1380 

CAGATAAGCA CCTAGTACAA AGAATAAAAT AGTTAGGCAT ATAATGGTTT CAGCCAATAC 1440 

CAGGTAATCC AGAAATGGAA GTTTCAAAAT TCCCTGAGCC ATCTTGAGCG AGGTCGCTGT 1500 

GATAATGGTT GGGAAGGTGA GGGCTGAGAA GGCTGGTTGA AAACCTTGTT TTAAAATGTT 1560 

GGGCAGACGA GTTAAAACAA AGAAAAAGAA GC-ATTGAGAA GCCAAAATCA TGACAATCAA 1620 

GACCCAAGTC GGCAGGCTGG TTCCTCCTAC TCGAACTAGA GAAGCCAAGA GTAGAGAGAA 1680 

AGGAGCACAG TAGATTCCTT CTTGTCCAAG CAAGGCTAGT GGGAGTGGAT GTTTCTTTAA 1740 

ATCGCTATAA ATAAGGGGAT AGAGATAGAA GGTCAAGAGA AAACCAAAAC TCAAGGTCGC 18 00 
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ATAGGCAATT TCGATAATAC CTACCAGAGG ATAGGTCAAG GCAGCCACTG CTATCCCCAC 1860 

ATAGAGAACC GTCCAGCTTG GAGTGGCATG AACCCTCCGC CCTGGACAAG CAAACTTGAT 192 0 

GGTAAAACCA GCAATCAAGG TCAAATCCAA GAGAAATGAA AACCACCAAA TCCCTTGTGC 19 80 

TACCAAAGGA AGATAAGAGA ATACGCGAAA GACATAGGTC GATAAAATCA TCCCAGCCAT 2 04 0 

AGGAAAGGTT GCCATTCCTG ACAAAAGAGG GGGCTTGGTC AATTCTTGCT TGGTTTCTTT 2100 

CCAATTAAAG AGATGCAGAA TTAGAAAGTA AATCCATAAA ACCAAACCAA TCAGACTAAA 216 0 

AAGATGGGAT AGAACCGGCA ACGTATCTAA AATAAGATTT CCAGCTCCTG CCAAACCTAG 222 0 

CAAACAACCT GAAAATACTA AGGGGAGTTT TTTCATCCTA ACCTCCAATA ATCATGTTAG 22 80 

TTTCAGTATA ACATAAAAGC GCTTAAATGA GGATTTAAAA AAACGAGTCC GCTTATTTCA 23 4 0 

GACTTCATTT TACTCAGATA TGAATTAGGC ATAAGGTTGC AATTCTGGAT TAATTGGTGT 24 00 

ATTAGCTAAG TTGTTGGCAT AGTTACAGAG GATTGCTAGG CTGACACCAA AAACCACATC 24 60 

CAAGGCATTT TGTTGAGTGT AGCCAGCTTC TAAAAACTCA GACAAGGCTT CATCTCCTAC 252 0 

ACGACCCTTG GTATTGATAA CTGCCAAGGT AAACTTAGCT AGGGTATCCA ATTTAGGATC 2580 

TGTTTCAATT GGAGTACGAT TGCGAAGAGC TTGAATCAAG TCATCATTCA TCTGGATTTG 2 64 0 

TTTGATGGAA AAGGCTGTGT GACCTGCGAC ACAGAAGGCA CAACCATTGG TCACGGCTGC 27 00 

CGTGATTTGC ACCACTTCAC GCTCAACGGG TGTCAGGCTG TTGCGACGGT GGATAGATGA 27 60 

GACAATTTGG TAGGCTTCTA AAACAGTCGG GGCATTGGCC AAGAGACCGA TTAGGTTGGG 282 0 

AATATAGCCA TTGTTGTCTT TTTCTACTGT TTCAAGAATT TCTTTCACTT CTGCTGGTGC 2880 

TGACTCTACT GTATGGATAG TAAATGTTGT CATAAGATAC CTCTTTTCTT ATTATTGACA 294 0 

CTAATATTAT TGGAAAATCT TATAAAATCC TGATTCCTAA GTTTATCTAA GATAAAGCTT 3 000 

TATTCTCTCA TAAGATTTTC GTTGTTATAT TAGTTTATCA CACTTCCAAT CACTTGTATA 3060 

ATATATATTA TATATCAGGC TGATAAAAAT TATTTATAGG CAAAAAAATC ACACGAGCTG 3120 

TGTGATTCCA TTATTTGTCA AAATACTTTT TAGTTTCAGC AATAACGACT GGCGACAAGA 318 0 

CCAAGAGGGC AATCAAGTTT GGCAGAGCCA TCAAGGCGTT AACGATATCT GCGATAATCC 3240 

AGACCATATC CAACTCGATA AATCCTCCTA ACAAGACCAT GAGCACAAAA ACCACACGGT 3300 

AGAGCCAGAT AAAGCGAACC CCAAAGAGGA ACTCAAAACA GCGTTCTCCG TAATAGTTCC 3360 

AACCTAGAAT CGTTGTAAAG GCAAAAAGTA CAAGGAAGAT GGTCAAGAGA GCAGGCCCAA 342 0 

AGTGTGAAAA GTTTGTTGAG AAAGCTGACT GAGTCAAGGC AACCCCATTC AAGTCACCGC 3480 

TCCAAACTCC AGTTACCAAG ATGGTCAAAC CAGTTAGAGT A 3 521 
(2) INFORMATION FOR SEQ ID NO: 119: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1968 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 

AACCTGGGCA AGCAAGCTAA AAGCAATGGG ACCTGGAATC CTAATGGCAA CTGCCGCTGT 6 0 

TGGAGGTTCC CACATTGTAT CCTCAACTCA AGCTGGCGGT TCTTACGGTT GGTCTCTACT 12 0 

TCTCTTGGTC ATCTTAGCCA ATGTCTTTAA ATATCCATTT TTCCGTTTTG GTGCTGAATA 180 

CACAGCTGAT ACTGGAAAGA CTTTGGTTGA AGGTTATGCC GAAAAAGGAA AACTCTATCT 24 0 

CTGGATTTTC TTTATCCTCA ATGTCTTTTC GGCTATGGTC AACACGGCTG GTGTTGCCAT 300 

TCTGTGCTCA GCTATCATCG CCAGTGCCTT CCCAATGATT GGACTTAGCA TTACTCAGTG 3 60 

GTCCCTCATT CTCGTTGCAA TCATTTGGGC TATGCTACTC TTTGGAGGCT ACAAACTTTT 42 0 

AGACGGCATG GTCAAATGGA TTATGTCTGC CTTAACCATT GCGACTGTTC TTGCAGTTAT 48 0 

CATTGCGGCG GTCAAGCATC CAGAATACAG TTCTGATTTT GTCGAGAAGA CACCTTGGCA 54 0 

AATGGCAGCT CTGCCCTTCA TCGTCTCCCT CCTAGGATGG ATGCCGGCTC CTATTGAAAT 600 

TTCAGCCATC AATTCACTTT GGTCAGCTGA AAAGAGAAAG ACCGTCAACT TTAACACAGA 660 

AGACGCTCTG TTTGACTTTA ACACTGGTTA TATTGGAACA GCTATCCTAG CCGTCTTCTT 720 

TGTGGCACTG GGAGCACTGA TTCAGTATCC TACAGGGCAG GCGGTTGAAG CTGCTTCAGC 780 

CAAATACATC TCTCAATTCG TGGGCATGTA TGCCTCTGTT CTTGGCGAAT GGTCCCGTTA 840 

CTTGATTACC TTTATTGCCT TCCTCTGTAT CTTTGGAACA GTTATAACTG TTATCGATGG 900 

CTATTCTCGC GTTAATCAGG AATCTCTCCG ACTGCTAATC AGTCAAAAAG AGGACAATCG 960 

TAAATCTTTG AACATCTGGA TGACCATCAC TGCTATCATC GGTATCGTCA TTATCAAGTT 1020 

CTTCGCTGGT CAGGTTTCAA CCATGCTCCG CTTTGCCATG ATTGGCTCTT TCCTGACAAC 1080 

ACCTTTCTTT GCTCTTTTGA ATTACGCCTT GGTAACGCGT GAAAACAAAA ATCTTCCTTC 1140 

TTGGCTCAAA CACCTTGCCA TTGCGGGATT GATTTTCCTC TTTGCTTCGC CATCTTCTTT 1200 

ATCTACGCAC TCGCAATCGG AAAAGCAGGG TAAGGGACAA GCGCGAGATG AAGATAAGGT 12 60 

TTCATTTCAA GAGAAAATTC AGCAAATATT TCTATGATAA AAAGCATAAG AACAAGGTTT 1320 

TGAAGACCTG AACTTATGCT TTTTTACGTT CTTAAAGACT GTTTATACTC AAAAAACAGT 1380 

TGAACAACTT CAACCACCTC TTATAAGAAC TTTATACTAT TCGAGAATCT CTTCAAACCA 1440 
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CGTCAGCTCT ATCTGCAACC TCAAAGCTGT GCTTTGAGCA ACCTGCGACT AGCTTCCTAG 1500 

TTTGCTCTTT GATTTTCATT GAGTATTAAT TCTCCTTTTC CAACTCATAC AAATCTGCGA 1560 

TAATAGCTGC GACATGTTTG ATATCTTCCA GCATGCCTCG CATTTCAAAG TCAGCCAATA 162 0 

CAGGGAAGCC AAAGCGTTGA CTGTATTGCT TGGCTGTTAG GCAGTATTGG TTATTAAAGT 1680 

TACGATTTCC TGACCCAACC ACACCAAAAC ACTTACTAGC ATTGTTACCA TAGGCAATAA 1740 

AATCTCCCAC CGGTGTCGTC AAAATCTCAA CATCTCCGTT ATCCACGCCA TTCCCACCTT 1800 

CGAGATAGGT CGGCAAAAAA GCGACATAGG GATGGTCCAT TTCATAGAAA TTTTTGCCTT 1860 

CCTTGACCAA ATCCTTGATA TGAATCTTTT GAACCTCAAT CCCTTTGTAC TGGGACAAGA 192 0 

GATAGTCTTT CAAGCGCGTC ACAAAACTTT CAGTGTTGCC ACTCAAGG 1968 
(2) INFORMATION FOR SEQ ID NO: 12 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7172 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

CCGCATTTTT TATCACTAGA CTCGAGACAT CTTTTGAGTG GCTCTTGCTC TCTGGTTTAA 6 0 

TTTTCTTCCT TGCTCAAGGA CTCCTGCTAT TTCTCTTGGT CGTCCGACTC AAACATCAAT 12 0 

TCGCTGAGAT TTATCCTCAA AT C AATAAAA AGATTCGCTT CTACTATTTA GGGGTTCTCA 180 

CCATTGATTT TCTATTTTTT GTTCTCTTAG CCTTCATTAG TTCTCAGCGT TTTTCATCTC 240 

TTATGCCAAT CATCACTGCT TGCCATTCTA CTTTTTATTA TATGACAGCT GACTACCTAA 300 

GAGAAAACTA TCCAGACTTT TACGACAAAC ACATCTCTTT ATGGGAGTGT CTCTAAAGAA 360 

AAGGAGGTTT TAGCATGAAA AAAATCATCT TCATCAAAAC CATTCAACTC CTTGTCATTG 42 0 

ATGGAATCAT GCTGGCATTT TTGACATTTA AAAGGGGGCT TACTTGGGAC TGGATTTTGA 48 0 

TTTATAGCGG TTGGCTCATT TTCTTTCATC CTGTGCTATT GACCTATCTT TCAAACCAAC 54 0 

TTTGTGACCA CTTTAGTTAA CTCTATTCCC AGATTAGACC GAGATTCTGG CGTTTTGCTT 600 

TACAAATTCT CCTATGGGAT AGCCTGATGA TTCTCTCCTT GGTGTCTTTA AGTGATATTC 660 

CACTTTTCCT TCAGGGAACT CTCCTCATCC TAGGACATCT CATCCCTTCC TATCGCATCT 72 0 

GCCAAAGCCT GAAAAGAGAC TTCCCCCAAG CATATCAAGA ACCGATTTCT TTTTGGAGTA 78 0 

TTTTATGATA GATGAGAAAG ACCAAGCCGA CTGGGCTTGG TCTTTCTTAT CTCTTTTTAG 84 0 

TATCTAGGAT AATGGTAACA GGTCCATTAT TAACCAGCTC AACCTGCATA TCTGCTCCAA 90 0 
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AGATGCCTGT CTGAACGGGC ACTTCTTGCG CTAATTTTTG ATTGAAAGCA TCATAGAAGT 960 

CTGATGCCAT ATCAGGTTTA GCTGCCCCTG TAAAGgCTGG ACGATTGCCT CTCTTAGTAT 1020 

CCGCAAAGAG GGTAAACTGA GAAATAGAGA GGATTTCTCC TTCAATATCT TTGACAGACA 1080 

GGTTCATCTT GCCTTCTGCG TCTGAAAAAA TCCGCATATT GACCAGTTTT CTCACAGCAT 1140 

AGTCCAAATC TTCCTCTTGG TCCTCTGGTC CAACACCAAC CAGCAATAAA AGTCCCTGAT 12 00 

TGATTTTTCC CTGAATCTGG CCTTCTATAC TCACTTGGGC TTTTTTAACC CGTTGGATAA 12 60 

TGATTTTCAT AATAGCCTTT CTAGTAAGAG CTAGGACAAC TAGCCGTTGG TCCGTTTGAC 1320 

AGAGTAAACT TCTGGCACAC TCTTAATTTT ATCGACAACC GTGGTCAGTG TAGAGAGGTT 13 80 

GGCAATACCG AAGgACACAT GGATATTAGC AAACTTCATA TCCTTGGTTG GTTGGGCATT 1440 

GACCGTTGAA ATATTCTTGG TTGTATTTGA AAGAACTTGC AGTACATCGT TCAACAGTCC 15 00 

TGTACGGTTG AGACCGTAGA TATCGATATG GGCCATATAC TCCTTATTTG AGCTAGGGTA 15 60 

CTGGTCTTCC CATTCCACAT CAAGGAGACG TTGCTCGTAG TTTTCTTGGG CACGCAGGTT 1620 

CATACAGTCC ACACGGTGAA TAGCCACACC ACGACCCTTG GTAATGTAGC CAACAATATC 1680 

GTCACCAGGC ACGGGGTTAC AACACTTAGC AATCCGCACT AGGAGACCAG AAGCACCTTC 1740 

AATAACCACT CCCCCCTCAT GCTTGACCTT GAGGGTTTCT TTATTTTCAA CCTTGACCTC 1800 

GCCACCTTTG ACAAGCTCCT CTGCCTCAGC TTTGGCCTTG GCACGCTCTT CCTCACGGCG 18 60 

TTCCTTTTCA GTCAGACGGT TAAAGACGGT AATCGCACCG ATTTCCCCAA AACCAATGGC 1920 

CGCAAAGAGG GAGTCTTCTG TCTTGTAACT GGTCTTTTGC AGAACTTGAT CCATGTGGCG 1980 

CTTGTCCATA AATTTATTTG CCACATAGCC ATTTTCTTGG AACTGAGCCA TCAGCATCTC 2040 

ACGACCCTTG TTGACAGACA ATTCCTTATC TTGGTTTTTA AAGAACTGGC GAATCTTATT 2100 

GCGCGCCTTG CTAGTCTTGA CCATATTGAG CCAGTCACGG CTAGGTCCAA AGGAGTTCGG 2160 

GTTGGCGATA ATTTCAACCT GATCCCCTGT CTTTAACTTG GTTGTCAGTG GAACCATGCG 2220 

GCCATTGACC TTGGCACCAG TTGCTTTTTC ACCGACCTTG GTATGGATTT CGTAGGCAAA 22 80 

ATCAATCGGT CCTGAATCTT TGGGAAGGGA ACGGACAGCT CCATCTGGGG TAAAAACGTA 234 0 

AATCTCCTCA GCCAAATAGT TTTCCTTAAC AGAGTCCACA AATTCCTTAG CATCATCAGC 24 00 

CTGGTCTTGG AGCTCCATCA TCTCCTTGAT CCAGTTCATT CCAATAGCTG ATTCCTTGCT 24 60 

GTTAACTTGC CCCTTTATAC CTTTCTTATA AGCCCAGTGA GCCGCAACCC CGTACTCAGC 252 0 

CACCTCGTGC ATTTCCTTGG TTCGAATCTG GAATTCAATC GGCCCTTTTG GTCCATAAAC 2580 

AGTCGTATGG ATAGACTGAT AACCATTGGC CTTGCGGTTG GCGATATAGT CTTTGAAGCG 264 0 
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ACCTGGCATC GGTTTCCAAA ATTCATGCAC GTAACCAAGC ATGGCATAAA CATCACTTTG 2 7 00 

GGTATCTAAA ATACAACGAA TAGCAATCAG ATCATAGATT TCCTCAAACC GTTTTCTCTT 27 60 

GTCCTGCATT TTGCGGAAAA TTGAGTAAAT ATGCTTGGGA CGACCATAAA TCTTCCCTTT 2 820 

CAAGTGACGT TCTGTCGTAT ACTCCTCTAA TTTTGTGACT ACCTCATCCA CCAAGGCCTC 2 880 

ACGCTCCCTG CGCTTTTCCT TCATCATATG GGTAATCTTG TAAAACTCCG TTGGATTGAG 2 940 

ATAACGGAAA GACAAGTCTT CTAATTCCCA TTTGACACTG GAAATCCCCA AACGATGGGC 3 000 

AAGCGGGGCA TAGATTTCCA TGGTTTCTTT GGAAATACGC TCCTGCTTGT CTTTTCGAAG 3 060 

ATGTTTCAGG GTCCGCATAT TGTGCAAGCG GTCAGACAGT TTGACCAAAA TAACGCGGAT 3120 

GTCCTCAGAC ATGGCCATGA GCATCTTGCG ATGATTTTCC GCTAATTGCT CCTCGATCGA 3180 

TTTGTACTCG ACCTTGCCAA GCTTGGTAAC TCCGTCAACA ATCATCCGCA CATCAGGACC 3240 

AAACTCTCTT TCCAAATCGT CCAAAGTCGC ATCTGTATCT TCCACCACAT CATGCAAGAA 3 3 00 

TCCACAAGCT ACTGTTACAG CATCCAGCTT TAGCTTAGCT AAAATACCTG CCACTTGGAT 33 60 

AGGGTGAATG ATATAAGGCT CGCCTGATTT GCGATATTGA CCACTGTGGC ATTCAACAGC 34 2 0 

ATAGACCAAG GCCTTATGGA CAAAATGAAC ATCCTCTTCC GTTAAATATT CTTTGGTTAA 34 8 0 

AGCGACAACT TCTTCGCCTG TTAAATTCAC TTCTTTCGGC ATCTCTACTC TCCAATTCTT 354 0 

CCTACCATTT TATCACTTTT TTAAGAATAT GAAAACTAGA TTGGAACAGA ATAAGAAAAA 3600 

AATAATTCAA AATTGCTTGA TAATTCTGAA TTATTGGTCC GTAATATACT ACGAAGTTAG 36 60 

ATTTTAAACT TAGGTGATAG AAGGAGAGAT AGAAGAACGG AAACCATATT GTAACCCAAA 37 20 

GACTTTCTGA CTTCCCCAAT TCCATTGAAG ATACGAAAGA TAAACGGTGG AACTCGTATC 37 80 

ACATACACTG GTACCTTGAC TGGATTTTGG AATTAATACT AAATGAAAAT CAAAGAGCAA 3 84 0 

ACTAGGAAAC TAGCCGCAGG TTACTCAAAG CACCGCTTTG AGGTTGCAGA TAAAGTTGAC 3900 

GCGGTTTGAA GAGATTTTTG AAGAGTATAA AAATCCTCAA GATACTTTCT TCTATCCTTT 3 9 60 

AGTTTATAAG GAGAATACCT ATGAAAAAAA CTGCTATTTC TATCTTTGCT CTCCTAATGT 402 0 

TAGGAGTTTG CTGCCTGTTC CTATTCAGCC AGCAAAGCTA TAAAAAACAG TCGTTCAATA 4080 

CTATGCTAAC GACCAGAACC TGCCCAGTAG GATAACTTAT AGTGAATATA GCGACAAATG 414 0 

AGAAGCCAAC TACGGTAGCA CTCTAAACAT CACGTCTATC AAACAAGCTA ATGACGGAGT 4200 

TTATGCAACC TATGAAGGGC AATTGACACC TTTCCAATAT TGATAAATTG ATAACCAGCC 4260 

TGTCTTCATC TAGTCATGCT GGTTTTTAAG TTCATTTTAA ATCCTTACCT ATTCTCCCTA 432 0 

ACTGTGCTAT ACTTAATTTA TACTCAATGA AAATCAAAGA GCAAACTAGA AAGCTAGCCG 4380 

CAGGCTGTTC AAAGCACTGC TTTGAGGTTG CAGATAAAGT TGACGCGGTT TGAAGAGATT 444 0 
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TTCGAAGAGT ATTAGTACAT TCTTTGAGAT TGGAGCTAGT ATGAAAATCC ATAAAACCGT 45 00 

GAATCCTGTT GCCTATGAAA ATACCTATTA TCTAGAAGGC GAAAAGCACC TCATCGTCGT 4560 

CGATCCTGGT AGTCATTGGG AAGCCATTCG TCAGACAATC GAGAAGATCA ACAAACCGAT 462 0 

CTGTGCTATT CTCTTGACCC ACGCCCATTA TGACCATATC ATGAGTCTGG ACTTGGTTCG 4680 

CGAGACGTTT GGCAATCCTC CTGTCTATAT CGCAGAGAGC GAAGCCAGCT GGCTCTACAC 474 0 

TCCTGTCGAT AATCTCTCCG GTCTCCCTCG CCACGATGAT ATGGCAGATG TGGTCACAAA 4800 

ACCTGCAGAA CACACCTTTG TCTTTCACGA AGAATACCAA CTAGAGGAAT TTCGTTTTAA 4860 

GGTTCTACCG ACCCCAGGGC ACTCTATCGG TGGTGTTTCC CTAGTCTTTC CTGATGCTCA 492 0 

TCTAGTCTTG ACGGGAGATG CTCTATTCCG CGAAACTATC GGACGGACCG ACCTTCCGAC 4980 

TGGTAGCATG GAGCAACTCC TTCATAGTAT CCAGACCCAA CTCTTCACCC TACCAAACTA 504 0 

CGATGTCTAT CCAGGACATG GTCCAGCTAC TACTATCGCT CACGAAAAGG CCTTCAATCC 5100 

CTTTTTCTAG CAAGATGATG ACAATCGAAA TTTAAGTAAA CTATCCAGCA AATCTTTCTA 5160 

TTACAAAAGG CATCCTATCA AGGTTTTCAC ACATGATTGG ATGCCTTTTT TCTGATGACT 5220 

AGATTTTTTG CATTACCAAA TAATCACGCG CTCCTCTGGT GAACGCCACA TTCCGTCTCC 5280 

TTCTTTGACA TCATAGGTTG TAAAGAAATC GTCGAACTTT GGTACTTGCA CATTGACACG 5340 

GAGTTTGGCT GGTGCGTGCA CATCGACGCT AGCCAAAAGT TTCATAAATT CTGGTCGACC 5400 

TTTCATGCGC CAGATGCGAC CGAAGTTGTA GAAGAACTCT TCTGCTGAGA AGTCTGCTTC 5460 

TCTCTTAGCT GCTTCAAGCG CTGCTGCGAT TCCTCCCAAG TCAGCCACGT TTTCTGATAC 5520 

AGTCAATTTA CCGTTAATGG TTGCTCCATA AGAATCCTGT CCATCAAATT GGTCAATGAC 5 580 

TTTTTGTGTT TTCTCCTTGA AGGCAGCATA GTCGCTCTCT GTCCACCAAT CCTTGAGGCT 5 640 

ACCATTTTCG TCAAAGGAAG CCCCGTTAGT ATCAAAGGCG TGGGAAATTT CATGGGCAAT 57 00 

CACTGCCCCA ATACCACCGT AGTTAGCAGA AGATGACTGA TGCAAGTCAT AGAAAGGCGC 57 60 

CTGTAAAATG GCCGCTGGAA AGACAATCAG GTTCTTCTGA GGATTGTAGT AGGCATTGAC 5820 

CATATGAGCA GGCATGCCCC ATTCCTTATA ATCTACAGGC TGGTTCCACT TACTCCAACT 5 8 80 

GTGCTTGATT TCCACACGCG CAAAGGCTAG AGCATTCTCA AAAAGACTGG CAGTTTCATT 5940 

CACTACCTTA TCCTTGTAAC GTGCAGGCAA TTCTTCTGGA TAGCCAATAT AAGGTTTGAT 6000 

CACATTGAGC TTCACGATAG CCTGTTTACA GGTTTCTGGA GTGAGCCAGT CATTCTTAAG 6060 

CAGACGCTCC TTATAAACAT CAATCATGGT TGCCACTTTT TTCTCCACAT CCGCCTTGGC 6120 

TTCTGGAGAG AACTTCTCAC GGGCGTACCA AAGACCCAGG GCTTGCTTGA AAGGTTCTTG 6180 
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TGCTAGATGA TAAGCTGCTT TGACCTTATC TTTTGCCTCT GGAACTCCAG AAAGGGCACG 6240 

GCTGTAGGCA CCAGACAAAA CACGGATATC CTCTGTTAAA TAGCTGGTTG AAAGATTGAC 63 00 

AACACTCAAA ATCAAGGTTG CTTTAAGGAG AGACCAGGCT TCCTCACTGT AGAATTGCTC 6360 

TGCTGCTTGC CAGAAACGTT CCTCGTCTAC AATAACCTTG TCTGGTAATT GCCCAATAAC 6420 

TGCTTTGAAG AAGTCATCCA AAGGTAGGGC AGGCGCGAAT TTCTTGAAAT CTTCGTAAGA 6480 

ATATGGATGA TAGAGTTTAG CATATTCTGA ACTTTCTTCA TTAGAGAGCA CCACTGCCGC 6540 

AACTCGGCGG TCCAATTCAA GTCTTTTTTC TAGCAAGTCT TCAATTTCTT CATCAGAGAA 6 600 

ATCATAAGCC TTGAGGAGAT TTGCGCTGCT TTCTTTCCAA AGAGTCAAGA GCTCTTCGCG 6660 

CTGAGGATGT TCTTCTGCAT AGTAGGTCGT ATCTGGCAAG ATTGTGCTTG GAGCGCTAGC 6720 

CCATAGAACA TTGATTCTAG CATCCATAAA GTCTGGCGAT ACACCAAAAG GAAGGAAGTT 6780 

TGGTTTTCCT GCAAGCTCAA ACTCTGCTAG TTTAGCTGTA AAATCCGCAA AAGTCTCCAA 6 840 

TTCTTGGAAT TCTTTAAGGA GTGGTAAGAC AGGTGTGATA CCGTCAGCTT CTCTCTTGTC 6900 

AAAATCACGA ACTAGGCGGT GGTATTTGAC AAAGTTTTCC AAGATAGCAT CCTCAGGCAC 6 960 

TTCTTCACCT GCTAACCACT TGTCTGTTGT CGCCAGCATC AGGTCTTCAA TTTCCTGGTC 702 0 

TAAATCAACA AAACCTCCTG TTTGAGACTT ATCTGCTGGG ATTTCAGCTG TCTGTTGCCA 7080 

TTCTCCATTG ATAGCATCAT AAAAATCATC TTGATAACGT GTCATCTTGT TCTCGCTTTC 7140 

ATTTGTATTT GCATTTATCT TAACAAAAAT CG 7172 
(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

CGGGAAGTTA TGCGATCTAG ACTTCGTTCC TGTACAGCTA CTTTCTCAGG TGGTCTTGTT 60 

GTTTGTATGA GTTTGTTTAG AGAGGATCTT TCTATGTCTT TCTTTCTTAT TTTTGTTTTA 12 0 

TATGCTTTTC TGATTTCTTA TCTAATTTAT GGTTATTTCA GACTAAAAAG GAAATACCGA 18 0 

GTAGATGAAT AGCAAGGTTC TAGGTCTTCA GATTGATTTT TAGCACTCTT GATAAAAGAG 240 

TGCTAATTTT TTGAGTTTTT GTCTTGACAT TCTCTTCTAA GGGTGTATAA TAGAATCATG 300 

AGTTAGCACT TGGATGCATT GAGTGCTAAT TGATCAGACA GAGAGGAGTG ATGAGATGGT 3 60 

TACAGAGCGT CAGCAGGATA TTTTAAATCT GATTATTGAC ATCTTTACCA AAACGCACGA 42 0 
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ACCTGTCGGA TCAAAAGCCT TGCAAGAGTC TATTAACTCT AGCAGTGCAA CCATTCGTAA 4 80 

TGACATGGCG GAACTAGAAA AACAAGGGTT GCTTGAGAAG GCTCATACTT CAAGTGGTCG 540 

GATGCCAAGT GTTGCTGGTT TTCAGTACTA TGTGAAACAC TCACTGGATT TTGACCGGCT 600 

GGCTGAAAAT GAGGTATATG AGATTGTCAA AGCCTTTGAT CAGGAATTCT TCAAATTGGA 6 60 

GGATATTCTG CAAGAGGCTG CTAACTTACT AACAGACCTG AGTGGCTGTA CGGTAGTGGC 72 0 

ACTGGATGTT GAGCCGAGCA GGCAACGTTT GACAGCCTTT GATATCGTTG TTTTGGGGCA 7 80 

ACATACAGCC TTGGCGGTAT TTACCCTAGA CGAGTCGCGA ACGGTTACTA GTCAGTTTCT 840 

GATTCCAAGG AACTTCTTGC AGGAGGATTT GCTGAAACTG AAGAGCATCA TTCAGGAACG 900 

TTTCCTCGGT CACACCGTTT TAGATATTCA CTACAAGATT CGGACGGAGA TTCCGCAGAT 960 

TATCCAGCGT TACTTTACAA CAACGGATAA TGTCATCGAT CTCTTTGAAC ACATCTTTAA 102 0 

GGAAATGTTC AACGAAAACA TTGTGATGGC GGGCAAGGTC CATCTCTTGA ATTTTGCCAA 1080 

TCTAGCAGCC TATCAGTTCT TTGACCAACC GCAAAAGGTG GCCTTGGAGA TTCGTGAGGG 114 0 

GTTGCGTGAG GATCAGATGC AAAATGTTCG TGTTGCAGAC GGTCAAGAGT CCTGTTTAGC 12 00 

TGACCTAGCG GTAATCAGTA GTAAGTTCCT CATTCCTTAT CGGGGAGTTG GAATTCTAGC 12 60 

CATTATCGGT CCAGTTAATC TGGATTACCA ACAGCTAATC AATCAAGTCA ATGTGGTCAA 132 0 

CCGTGTTTTG ACCATGAAGT TGACAGATTT TTACCGCTAC CTCAGCAGTA ATCATTACGA 1380 

AGTACATTAA GATTGAAATC ATTAAAGGAG GCGAACATGG CCCAAGATAT AAAAAATGAA 1440 

GAAGTAGAAG AAGTTCAAGA AGAGGAAGTT GTGAAAACAG CTGAAGAAAC AACTCCTGAA 1500 

AAGTCTGAGT TGGACTTGGC AAATGAACGT GCAGATGAGT TCGAAAACAA ATATCTTCGC 1560 

GCTCATGCAG AAATGCAAAA TATCCAACGC CGTGCCAATG AAGAACGTCA AAACTTGCAA 162 0 

CGTTATCGTA GCCAGGACTT GGCAAAAGCA ATCTTACCAT CTCTTGACAA CCTTGAGCGT 1680 

GCACTTGCAG TTGAAGGTTT GACAGATGAT GTGAAGAAGG GCTTGGGGAT GGTGCAAGAA 1740 

AGCTTGATTC ACGCTTTGAA AGAAGAAGGA ATTGAAGAAA TCGCAGCAGA TGGCGAATTT 1800 

GACCATAACT ACCATATGGC CATCCAAACT CTCCCAGCAG ACGATGAACA CCCAGTAGAT 1860 

ACCATCGCTC AAGTCTTTCA AAAAGGCTAC AAACTCCATG ACCGCATCCT ACGCCCAGCA 192 0 

ATGGTAGTGG TGTATAACTA AGATATAAAG CCCGTAAAAA GCTCGCAGTA AAAATAGGAG 1980 

ATTGACGAAG TGTTCGATGA ACACAAGAAA ATCTATCTTT TTTACTCAGA GCTTAGGGCG 2040 

TGTTCGATTC GGCAATTCTG ACGGTAGCTA AAGCAACTCG TCAGAAAACG GCAATCGCTA 2100 

TGGCGTTTGC CTAGCTTCCT TACTAACTCG TCGTCGAAAT AAAATCGATT TCGACTCCTC 2160 
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GTGTCGCAAT TTACATAATA GAAAACTTGT CCGAAACGAC AATAAACTAT GAAGAAAGAT 2 220 

AAAATATGTT TGGCTTTGTA ATAGTGAGCG AAGCGAACCA AACACGATAC TCTTCGCCGT 2 2 80 

GGCGCTATTT GCGCAAATTT TGAGACCTTA GGCTCAAAGT TTAGTCAAAG AGATTGACGA 23 40 

AGTCAAGCTC TGACGGCGTC GCCACTGTCG CCACTTAAGA AGAGTATCAA AAAGAAAAAT 24 00 

AGAAAATTAA CTAACAAGGA GAAAAACACA TGTCTAAAAT TATCGGTATT GACTTAGGTA 24 60 

CAACAAACTC AGCAGTTGCA GTTCTTGAAG GAACTGAAAG CAAAATCATC GCAAACCCAG 2 520 

AAGGAAACCG CACAACTCCA TCTGTAGTCT CATTCAAAAA CGGAGAAATC ATCGTTGGTG 2 5 80 

ATGCTGCAAA ACGTCAAGCA GTTACAAACC CAGATACAGT TATCTCTATC AAATCTAAGA 2 640 

TGGGAACTTC TGAAAAAGTT TCTGCAAATG GAAAAGAATA CACTCCACAA GAAATCTCAG 27 00 

CTATGATCCT TCAATACTTG AAAGGCTACG CTGAAGACTA CCTTGGTGAG AAAGTAACCA 27 60 

AAGCTGTTAT CACAGTTCCG GCTTACTTCA ACGACGCTCA ACGTCAAGCA ACAAAAGACG 2 82 0 

CTGGTAAAAT TGCTGGTCTT GAAGTAGAAC GTATTGTTAA CGAACCAACT GCAGCAGCTC 2 880 

TTGCTTATGG TTTGGACAAG ACTGACAAAG AAGAAAAAAT CTTGGTATTT GACCTTGGTG 2 94 0 

GTGGTACATT CGACGTCTCT ATCCTTGAAT TGGGTGACGG TGTCTTCGAC GTATTGTCAA 3 000 

CTGCAGGGGA CAACAAACTT GGTGGTGACG ACTTTGACCA AAAAATCATT GACCACTTGG 3060 

TAGCAGAATT CAAGAAAGAA AACGGTATCG ACTTGTCTAC TGACAAGATG GCAATGCAAC 312 0 

GTTTGAAAGA TGCGGCTGAA AAAGCGAAGA AAGACCTTTC TGGTGTAACT TCAACACAAA 3180 

TCAGCTTGCC ATTTATCACT GCAGGTGAGG CTGGACCTCT TCACTTGGAA ATGACTTTGA 324 0 

CTCGTGCGAA ATTTGACGAT TTGACTCGTG ACCTTGTTGA ACGTACAAAA GTTCCAGTTC 3 3 00 

GTCAAGCCCT TTCAGATGCA GGTTTGAGCT TGTCAGAAAT CGACGAAGTT ATCCTTGTTG 33 60 

GTGGTTCAAC TCGTATCCCT GCCGTTGTTG AAGCTGTTAA AGCTGAAACT GGTAAAGAAC 34 2 0 

CAAACAAATC AGTAAACCCT GATGAAGTAG TTGCTATGGG TGCGGCTATC CAAGGTGGTG 3480 

TGATTACTGG TGATGTCAAG GACGTTGTCC TTCTTGATGT AACGCCATTG TCACTTGGTA 3 540 

TCGAAACAAT GGGTGGAGTA TTTACAAAAC TTATCGATCG CAACACTACA ATCCCAACAT 3 600 

CTAAATCACA AGTCTTCTCA ACAGCAGCAG ACAACCAACC AGCCGTTGAT ATCCACGTTC 3 660 

TTCAAGGTGA ACGCCCAATG GCAGCAGATA ACAAGACTCT TGGACGCTTC CAATTGACTG 372 0 

ATATCCCAGC TGCACCTCGT GGAATTCCTC AAATCGAAGT AACATTTGAC ATCGACAAGA 37 80 

ACGGTATCGT GTCTGTTAAG GCCAAAGACC TTGGAACTCA AAAAGAACAA ACTATTGTCA 3840 

TCCAATCGAA CTCAGGTTTG ACTGACGAAG AAATCGACCG CATGATGAAA GATGCAGAAG 3900 

CAAACGCTGA AGCCGATAAG AAACGTAAAG AAGAAGTAGA CCTTCGTAAT GAAGTAGACC 3960 
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AAGCAATCTT TGCGACTGAA AAGACAATCA AGGAAACTGA AGGTAAAGGC TTCGACGCAG 402 0 

AACGTGACGC TGCCCAAGCT GCCCTTGATG ACCTTAAGAA AGCTCAAGAA GACAACAACT 4080 

TGGACGACAT GAAAACAAAA CTTGAAGCAT TGAACGAAAA AGCTCAAGGA CTTGCTGTTA 414 0 

AACTCTACGA ACAAGCCGCA GCAGCGCAAC AAGCTCAAGA AGGAGCAGAA GGCGCACAAG 42 00 

CAACAGGGAA CGCAGGCGAT GACGTCGTAG ACGGAGAGTT TACGGAAAAG TAAGATGAGT 42 60 

GTATTGGATG AAGAGTATCT AAAAAATACA CGAAAAGTTT ATAATGATTT TTGTAATCAA 4 32 0 

GCTGATAACT ATAGAACATC AAAAGATTTT ATTGATAATA TTCCAATAGA ATATTTAGCT 4 3 80 

AGATATAGAG AATTATATTA GCTGAACATG ATAGTTGTAT CAAAAATGAT GAAGCGGTAA 444 0 

GGAATTTTGT TACCTCAGTA TTGTTGTCTG CATTTGTATC GGCGATGGTA CCGTATCTGA 4500 

CGAACGTTCA GCTTATAT 4518 
(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8145 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDKESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

TGCTATTTTC GATTCCCTTG GGCGTTTTGA TTGCCTTTGC CTTGCAAGTC CATTGGAAGC 60 

CCCTCCATTA TCTGATTAAC ATTTACATCT GGGTTATGCG AGGAACCCCC TTACTCTTGC 12 0 

AACTGATTTT TATCTATTAT GTGCTCCCAA GTATTGGGAT TCGTTTAGAC CGCCTTCCTG 180 

CAGCTATTAT TGCCTTTGTT CTCAACTATG CAGCTTACTT TGCAGAAATT TTCCGTGGGG 24 0 

GAATTGACAC TATTCCAAGA GGACAGTATG AGGCCGCCAA GGTCTTGAAG TTTAGCCCTT 3 00 

TTGACAGAGT GCGCTATATT ATCTTGCCCC AAGTGACCAA GATCGTTCTT CCTAGTGTCT 3 60 

TTAATGAAGT TATGAGTTTG GTCAAGGATA CTTCTTTGGT CTATGCTCTC GGAATTTCAG 420 

ACCTTATCTT GGCTAGTCGA ACAGCTGCTA ACCGCGATGC TAGTCTAGTT CCTATGTTCT 480 

TGGCAGGAGC CATTTATTTG ATTTTGATTG GGATTGTGAC AATTATTTCC AAAAAAGTTG 540 

AGAAGAAGTA TAGTTATTAT AGATAGGAGG CTGCCATGTT AGAATTACGA AATATCAATA 600 

AAGTCTTTGG AGACAAACAA ATCCTGTCTA ATTTCAGTCT AAGTATTCCT GAAAAGCAAA 6 60 

TCCTGGCTAT CGTTGGACCT TCTGGTGGAG GTAAGACAAC TCTTTTACGT ATGCTTGCAG 72 0 

GTCTTGAAAC CATTGATTCA GGGCAAATCT TTTATAATGG ACAACCTTTA GAGCTGGATG 780 
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AATTGCAGAA GCGCAATCTA CTGGGATTTG TCTTCCAAGA TTTTCAACTA TTTCCTCATC 840 

TATCAGTTCT GGAAAATTTG ACTTTATCGC CTGTGAAGAC CATGGGAATG AAGCAGGAAG 900 

AGGCTGAGAA GAAGGCGAGT GGACTCTTGG AACAGTTAGG ACTAGGAGGA CACGCAGAGG 960 

CCTATCCTTT CTCACTATCT GGTGGGCAAA AGCAGCGGGT GGCTTTGGCG CGTGCTATGA 1020 

TGATTGACCC AGAAATCATT GGCTACGATG AACCAACTTC TGCCCTGGAT CCAGAATTAC 1080 

GTTTGGAAGT GGAGAAGCTA ATCTTGCAAA ATAGGGAACT TGGGATGACC CAGATTGTGG 114 0 

TTACCCATGA TTTGCAGTTT GCTGAAAATA TCGCAGATGT ATTATTGAAA GTAGAACCTA 1200 

AATAGGAGGA AAAATGGATG AAAAAATGGA TGCTTGTATT AGTCAGTCTG ATGACTGCTT 12 60 

TGTTCTTAGT AGCTTGTGGG AAAAATTCTA GCGAAACTAG TGGAGATAAT TGGTCAAAGT 132 0 

ACCAGTCTAA CAAGTCTATT ACTATTGGAT TTGATAGTAC TTTTGTTCCA ATGGGATTTG 1380 

CTCAGAAAGA TGGTTCTTAT GCAGGATTTG ATATTGATTT AGCTACAGCT GTTTTTGAAA 1440 

AATACGGAAT CACGGTAAAT TGGCAACCGA TTGATTGGGA TTTGAAAGAA GCTGAATTGA 1500 

CAAAAGGAAC GATTGATCTG ATTTGGAATG GCTATTCCGC TACAGACGAA CGCCGTGAAA 1560 

AGGTGGCTTT CAGTAACTCA TATATGAAGA ATGAGCAGGT ATTGGTTACG AAGAAATCAT 1620 

CTGGTATCAC GACTGCAAAG GATATGACTG GAAAGACATT AGGAGCTCAA GCTGGTTCAT 1680 

CTGGTTATGC GGACTTTGAA GCAAATCCAG AAATTTTGAA GAATATTGTC GCTAATAAGG 1740 

AAGCGAATCA ATACCAAACC T TTAATGAAG CCTTGATTGA TTTGAAAAAC GATCGAATTG 1800 

ATGGTCTATT GATTGACCGT GTCTATGCAA ACTATTATTT AGAAGCAGAA GGTGTTTTAA 1860 

ACGATTATAA TGTCTTTACA GTTGGACTAG AAACAGAAGC TTTTGCGGTT GGAGCCCGTA 1920 

AGGAAGATAC AAACTTGGTT AAGAAGATAA ATGAAGCTTT TTCTAGTCTT TACAAGGACG 1980 

GCAAGTTCCA AGAAATCAGC CAAAAATGGT TTGGAGAAGA TGTAGCAACC AAAGAAGTAA 2 040 

AAGAAGGACA GTAAGATAAA ATAGTGGCTG AAACTGCGTT TTGATTAGCA AAACGTAGTT 2100 

TTTTTTGTAA TCTAGGAAAA CGATAATAGC GATTGAATAT GGATAATTGA ATATGGAATA 2160 

GCCCACTGTG ATTTCTAAAA CATTGTTAAA AATTGATTTG ACTTCCAAAA TTAAAATGTT 2 220 

CTGTAATGAA ATACTGATGT AACTGTTTTA GGAACAATAA AACGCATAAT ATCAAGGTTT 2 280 

TTGCACCTTA CATTATGCGT TTTTGTGATT TTAAGACTTG TTAGCTGATT TTTTACAATC 2 340 

CTGCGAAATC TTTGATTTCT TGTGCTGACA TTGAAGAGTC GCAACGGACG TTGATTTGTC 2400 

CATCTGTAAT ATGAACAAAA CCTGGTACAG TTGGGATTCC ATAGCGTGAG CGGAATGCTT 2460 

GCAAATCATT GAGTTGGCTT GGTTCTTCAC TATTGATGAA GTAAATGTGA GCTTTGGTTT 2520 

CAGCTACGAC ACCTGACAAT GTACCTGCAA ATTTACGGCA GTAAGGGCAA GTTTTGCGAC 2 580 
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CGATAAAGAA GGTTGCAGTT TCTTTTTTAT CAAGAGCTTC TTGCGCACGC ACAACTGTAG 2 640 

TGACTTCAAG GTCTTTGATG TTATCTAAAA ATTGTTCCAT GAGATTACCT CGCTTTCATT 2700 

GATAAGTCTA GTATGCCATA AAGTTTCTAA AATTGCTTAG ATTTGATACG AAAAAAGATG 27 60 

AGGTTGGTTG GTCTCATCTT TTATAGGTCT TTATTTTACA AATGCATTGA TTTCTGCTTC 2 820 

GATGTTAGCA ATCTTAGCTT GTGATTCTTC GTTGGTTTCC CCTACAACTG CAATGTAGAA 2 880 

CTTGATTTTT GGTTCTGTAC CTGAAGGGCG AACGGCAATC CATGAACCGT CAGCAAGTGT 2 940 

GTATTTCAAC ACATCACTTG GAGGAGTTGT CAAGTTTGTA ACAGTACCGT CAGCAACAGT 3 000 

AGCAGTTTGT GCCTTGAAGT CTTCTACGAC AGTGATAGCT GTTGCGTTCC ATTCTGTTGG 3050 

AGCATTGTTG CGGAATTTAG CCATAATCGC TTTGATTTGT TCAGCACCAT CGACACCTGA 312 0 

AAGAGTAACA GAGATTGTTT TTTCTGCGTA GTAGCCATAT TCTTTATAGA TTTCTTCGAT 3180 

ACCGTCAGCA AGTGTCAAAC CACGAGAACG GTAGTAGGCA GCAAGTTCAG CAACTACAAG 324 D 

AACGGCTTGG ATGGCATCTT TATCACGTAC AAATGGTTTA ATCAAGTAAC CGAAGCTTTC 3300 

TTCAAATCCC ATCATGTAAG TGTGGTTGTG TTTTTCTTCG AATTCTTGGA TTTTTTCAGC 3 3 60 

GATAAATTTG AAACCTGTCA AGACGTTGAA CATAGTTGCG CCGTAGCTTT CAGCAATCTT 3 42 0 

CGTTACCAAG TCAGTTGAAA CGATAGATTT GCAGAGAGCG GCATTTTCAG GAAGAGTTCC 3 4 80 

AGCGTTTTTG TGAGCTTCCA AGATGTATTT AGCCATGATA GCACCGATTT GGTTACCTGA 3 54 0 

AAGGTTGAGG TAGCTACCAT CTTTTTGAAG AACTTCAACA CCAACACGGT CAGCGTCTGG 3 600 

GTCAGTTGCG ACAAGAACAT CTGCACCAAC TTGACGACCA AGTTCTTCAG CAAGGGCAAA 3660 

GGCTGCTTGG CTTTCTGGGT TTGGAGATGT TACAGTTGAA AAGTCTGGGT CAGCAGTTGC 372 0 

TTGCGCTTCA ACAACTTGAA CAGAGTCAAA TCCTGCTTGG GCAAGAGCAC GACGAGCCAA 378 0 

CATTTCACCA GTACCATGAA GTGGTGTGTA GACAATCTTC ATGTCTTTAC CAAATTCTTC 3 84 0 

AATCAAGGCT GGGTTGATGT TTATGTCCTT AACCTCTTTA AGGTATTCTA TGTCAACAGC 3900 

TTCGCCGATA ACTTCAATCA AGCCAGAAGC TTTTTCAGTT TCCACATCAG CAACTTCAAC 39 60 

TGCAAATGGG TTTTCGATTG CACGGATATA AGTAGTCAAA GCGTCCGCAT CGTGTGGAGG 402 0 

CATTTGTCCA CCGTCTTCAC CGTAAACCTT GTAACCGTTA AATGGAGCAG GGTTGTGGCT 4080 

GGCTGTGACC ATGATACCTG CGAAACAGTT GAGATGACGA ACTGCAAATG ATAGTTCTGG 414 0 

AGTCGGACGA AGGCTTTCAA ATACGTAAGA TTTGATGCCG TGTTTAGCAA GAACTGCCGC 42 00 

AGATTCAAAG GCAAACTCAG GTGAGAAGTG ACGGCTATCG TAGGCAATTG CTACACCGCG 42 60 

TTCTTTCTCG TTTCCACCTT TTGACTCAAT CAAACGAGCC AATCCTTCAG TAGCTTGGCG 432 0 
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AACAACGTAG ATGTTGATAC GGTTTGTACC AGCACCAACC AAGCCACGCA TACCTGCAGT 43 80 

ACCAAATTCA AGATTTGTAT AGAAGGCATC TTCCTTAGTT TTTTCGTCCA TATTTTCCAA 4440 

ATCTTGACGA AGGTAGTCAC GAAGCTCCAC AAAATCAACC CATTTCTGGT AATTTTCTTG 45 00 

GTAAGACATT CAAATTCTCC TTTATTTTTA AAACATTTAA TCAGTTTAAT TATATCATTT 4 5 60 

TTTTTAGTTT TAGTAAAACC TTATCTGCTT CGAACATCTC TTCAAACCAG GTCAGATTGA 4 620 

ATTTTGGGGT TATATGATGT TGAGGCTAGG AAAAATTCAA TTTCAGTAAA AAAAGTAAGT 4 6 80 

CTTCTCATAA CAAAACATTG ATATAGTTAC TTAGTTTTAA ACAAGCATAT TATAATAAAG 4740 

CTATGGCATA TAGTACTGAT TTTAAACAGC GAGCATTAGA TTACATCAAA GAGGGGCACA 4800 

GCCATGTCGA GGCAGCCAAG TTTTTTGGTG TTGGCGTCAG AACTCTCTTC ACGTGGGAAA 4 8 60 

AGAAAGACGT GAACAAGAAC ACATAGAGAG GAAAAAGCGA GTCGTCAAAA ACCGAAAGAT 4 92 0 

TCCTTTAGAG GAATTGAAAG CCTTTGTAGA GGCTCATCCA GATGCTTTTT TACGGGAAAT 4 9 80 

TGCGGCACAT TTTGATTGTG CTGTTCCTTC AGTATGGGCA GCTTTAAAGC AGATTAAGGT 5040 

CACTTTAAAA AAAGATGACG AGCTTTAAGG AACAAGACCC AGAAAAGTAG CCTTATTTCT 5100 

TAAGAATTTT AATAGTTTAA AGCACCTAGC ACCTGTTTAT ATTGATGAAA CAGGAATCGA 516 0 

CCGCTATCTC TATCGTCCTT ATGCAGGGGC TCCTAGAGGG GAGAAAGTCT ATGAAAAGAT 52 2 0 

TAGCGGACGT CGTTTTGAGC GAACTTCAAT TGTTGCAGGA CAAGTAGACG GAGAGTTTAT 5280 

AGCTCCCATG ATTTACAAGA AAAGCATGAC AAGCGATTTC TTTGTGGAGT GGTTCAAAAC 534 0 

GCAACTCCTA CCTGCTTTGA AGACACCTCA TGTTATTGTC ATGGGCAATG CTGGTTTTCA 54 00 

TCCCAAGAAC ATTTTGGATG AACTCTGCAT CCAAGATAAA CACTTTTTCT TACCTCTACC 54 60 

ACCTTATTCA CCGGATTTGA ATCCTATTGA GCAAGCTTGG GCTATCTTGA AAAAGAAAGT 552 0 

GACGGATGTA TTAAGGGAAG TTCCAACTAT TTTTGAATGT TTGGAATGCT TTTTTAAAAC 5580 

TAGATGACTA TAACGGTTCT AAAGGAACCT ATCGAGTAGT CATTAAAACT AAGGATACTG 564 0 

CTGGTTAAGA GAAGACGGTA TACAATCAAA CCATTCACCG TGTAGCCGAA ATCGTTCAGA 5700 

ATGAAGACTT GTATCAGAAT GAAGACTTGT ATAAGAAAGG TTTGAATGTT GAACTTGCGC 57 60 

ACCAACAAAT TAAGGGATTT TTTGAAGCAG AGTTTAAAAA TCGTATTAAT GGAGTTCTTA 582 0 

ATACTAAAAT AAAAAATAGT ACATTAAATC GTGTAAATAA AAAAACTATA CACCAGAGCA 5880 

ACAAAAACTC CATGATCAAT TTGAAGCAGA AGCAACGGAA GATGCTAAAA AACAAGGCGA 5940 

TATTGTGTTG AATGTTGACC AGGATTTCAT GAGCATATCT AAGTCTAATA AAAGTGGTTC 6000 

AGACTGGAAG AAAACTTTCA CAGTGAGGAT AACCAATAGG CTAGCAAATG ACTTGAATAA 6060 

TGTCTTGAAA CAGGTTGATA AAGATACTCC TAATACCCCA ACTTGGCTAA ACTCAGCTGC 612 0 
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TTCTAAAGCT AAAGATGATG ACAGAGTATA TAAACTACTG AAGACTCTTA TACCAGGAGA 6180 

AAATTACCTA TCATGTTAAG GATAATCAGC TAGAAGTAGA AACAGATAAA TACACATATA 6240 

CTGCCGCTAG AAATGGTAGT AAGGAAGTTG GTATTCAAGA GTCAGATATA GCAGCAACTC 6300 

TAAGTGCCGA TGAATATAAT TCTAATCGCC AAACTTTTGA GAGAGAATAC AAATACAAAA 6360 

GCAAATGCCC TTAATAATGG TTGGGCTAGA TCTGGTTCTG AAGAGTTCAA AAAGTTCTCC 642 0 

CACTTTGTAG GGGTAGACAA AGGGATTGTG CGAACGAATG TACTGACTGG TAAAAAACTA 648 0 

TCTGATAAGA TTAGGAAAGA AGTGGGCTCT GGAGATAGCA AACTAGGAAA AGGCGGCTAT 6540 

TTCTCTACTG GGGATGTTCT ATTAGGAAAA GATGTTGTTT CTTATACCGT ACAAGTATTT 6600 

TCAGAGAATA ATGAAAGAGT AGGAGTAAAC ACTCAAAGTC ACCGTGTTCA GTATAATCTC 6660 

CCAATTCTAG CTGACTTTTC AGTCATCCAA GATACTGTGG AACCATCACG AACCGTTGTT 6720 

GAAAAAATCA TTCCAAAACT AAATATTCCC GAAGAAGAGA AAGGGAAAAT AACCGAAGAA 6780 

ATCAAGAAAA AGAAAAAAAC CTCAGAATTG GCAGAACTAA TCTCAGAAAA TGTGAAAGTT 6B40 

CGCTATGTTG ATGAACAAGG GCGTTTGCTA TCATTGAAAA ATGATACTGG AATTGGAGAA 6 900 

AAAGAAAGTG ACGGAACCTA CATTACCAAT AAAAAACAAC TGATTGGTAC CAGCTATAAT 69 60 

GTCACAGATA AAAAACTCAG TAGCATGACT ACTACTGACG GAAAATATTA TACTTTTAAA 7 020 

GAAGCAGATA CAAATTCTGC AAGTTTAACT GGGAATATTG TAAGCGAAGG TAGAACAGTG 7 080 

ACCTTAGTTT ATAGAGAAAG CGAAGCGCCA ACCACTGCTA CAGTAACAGC CAATTACTAT 7140 

AAAGAAGGTA GGCAAGAGAA GTTGGTAGAG TCTGTTATAA AAGCTGATTT AGCGATAGGT 7200 

TCTGAGTATA CCACAGAATC AAAAACTATT GAAGGGAAAA CAACAACTGA GGACAAAGAA 7 2 60 

GACCGAGTTA TCACAAGGAA AACAACATAC ACCTTGGTAG CAACTCCTGA AAATGCGTAC 7 320 

CAGAAGACGG TGCAACAGTT GACTATTACT ACCGTGAGAA TGTTGAGGAA ACAGTGGTTC 73 80 

CCAAAACAGC AACCTCTACT GAGACGAAGA CTATAACGCG TATCATTCAT TACGTTGATA 7440 

AAGTTACGAA CCAAAATGTA AAAGAAGATG TTGTTCAACC TGTAACCTTA AGCCGTACAA 7 500 

AAACTGAGAA CAAGGTCACG GGAGTTGTAA CCTACGGTGA ATGGACAACA GGAAACTGGG 7 5 60 

ACGAGGTTAT ATCTGGTAAG ATTGACAAGT ACAAAGATCC AGATATTCCA ACAGTTGAAT 7 620 

CACAAGAAGT TACGTCAGAC TCTAGTGATA AAGAAATAAC GGTAAGGTAT GACCGTTTAT 7 680 

CAACACCAGA AAAACCAATC CCACAACCAA ATCCAGAGCA TCCAAGTGTT CCGACACCAA 7 740 

ACCCAGAACT ACCAAATCAA GAGACTCCAA CACCAGATAA ACCAACTCCA GAACCAGGTA 7 800 

CTCCAAAAAC TGAAACTCCA GTGAATCCAG ACCCAGAAGT TCCGACTTAT GAGACAGGTA 7860 
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AGAGAGAGGA ATTGCCAAAC ACAGGTACAG AAGCTAATGC TACCTTGGCT AGTGCTGGTA 7 92 0 

TCATGACCTT GTTAGCTGGT CTAGGATTAG GATTTTTCAA GAAAAAAGAA GATGAAAAAT 7 980 

AATAGATTTT AGAATCTAGG AACCAGGAAA AGCTCACAGA TGTGGGCTTT TTTCCTGGTT 804 0 

TTGAGAACGA GGTCTTTCGT AAAGAATAAA AACGCTTACA AGTCTGTTGA ACTGGGAAAC 8100 

TATGAATCCT ATTTTTTTAA AAATATTTCC AGAAATCAGT TGCGG 814 5 
(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8697 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

CGGTACCGGG AACGATACTT AGTCTAATTT TGCACCTTTT CCATGTATGG TAAAGGTTTT 60 

TCTTTTTTTA AAAAGGAAAA CGAGAAGAGG AGGTTCTTAT GAAAGCAAGC ATTGCCTTGC 12 0 

AAGTTTTACC CCTAGTACAG GGGATTGATC GGATAGCTGT TATTGATCAG GTCATTGCTT 180 

ATCTGCAwAC TCAAGAAGTG ACGATGGTAG TGACACCATT TGAAACGGTC TTGGAAGGGG 240 

AGTTTGATGA GCTTATGCGC ATTCTAAAAG AAGCGCTGGA AGTGGCAGGG CAGGAGGCAG 300 

ACAATGTCTT TGCCAATGTC AAAATAAATG TAGGAGAGAT TTTAAGTATT GATGAGAAAC 360 

TTGAGAAGTA TACTGAGACG ACACATTAGT CTATTGGGCT TTCTCGGAGT ATTGTCAATC 420 

TGGCAGTTAG CAGGTTTTCT TAAACTTCTC CCCAAGTTTA TCCTGCCGAC ACCTCTTGAA 480 

ATTCTCCAGC CCTTTGTTCG TGACAGAGAA TTTCTCTGGC ACCATAGCTG GGCGACCTTG 54 0 

AGAGTGGCTT TACTGGGGCT GATTTTGGGA C-TTTTGATTG CCTGTCTTAT GGCTGTGCTC 600 

ATGGATAGTT TGACTTGGCT CAATGACCTG ATTTACCCTA TGATGGTGGT CATTCAGACC 660 

ATTCCGACCA TTGCCATAGC TCCTATCCTG GTCTTC-TGGC TAGGTTATGG GATTTTGCCC 72 0 

AAGATTGTCT TGATTATCTT AACGACAACC TTTCCCATCA TCGTTAGTAT TTTGGACGGT 780 

TTTAGGCATT GCGACAAGGA TATGCTGACC TTGTTTAGTC TGATGCGGGC CAAGCCTTGG 840 

CAAATCCTGT GGCATTTTAA AATCCCAGTT AGCCTGCCTT ACTTTTATGC AGGTCTGAGG 900 

GTCAGTGTCT CCTACGCCTT TATCACAACT GTGGTATCTG AGTGGTTGGG AGGTTTTGAA 960 

GGTCTTGGTG TTTATATGAT TCAGTCTAAA AAACTGTTTC AGTATGATAC CATGTTTGCC 1020 

ATTATTATTC TGGTGTCGAT TATCAGTCTT TTGGGTATGA AGCTGGTCGA TATCAGTGAA 1080 

AAATATGTGA TTAAATGGAA ACGTTCGTAG AATTAGAATG TTTCTGAAAA AGAAAAGAGG 1140 
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AAATCAAAAT GAAGAAAACA TGGAAAGTGT TTTTAACGCT TGTAACAGCT CTTGTAGCTG 12 00 

TTGTGCTTGT GGCCTGTGGT CAAGGAACTG CTTCTAAAGA CAACAAAGAG GCAGAACTTA 12 60 

AGAAGGTTGA CTTTATCCTA GACTGGACAC CAAATACCAA CCACACAGGG CTTTATGTTG 13 2 0 

CCAAGGAAAA AGGTTATTTC AAAGAAGCTG GAGTGGATGT TGATTTGAAA TTGCCACCAG 1380 

AAGAAAGTTC TTCTGACTTG GTTATCAACG GAAAGGCACC ATTTGCAGTG TATTTCCAAG 144 0 

ACTACATGGC TAAGAAATTG GAAAAAGGAG CAGGAATCAC TGCCGTTGCA GCTATTGTTG 1500 

AACACAATAC ATCAGGAATC ATCTCTCGTA AATCTGATAA TGTAAGCAGT CCAAAAGACT 15 60 

TGGTTGGTAA GAAATATGGG ACATGGAATG ACCCAACTGA ACTTGCTATG TTGAAAACCT 162 0 

TGGTAGAATC TCAAGGTGGA GACTTTGAGA AGGTTGAAAA AGTACCAAAT AACGACTCAA 16 80 

ACTCAATCAC ACCGATTGCC AATGGCGTCT TTGATACTGC TTGGATTTAC TACGGTTGGG 174 0 

ATGGTATCCT TGCTAAATCT CAAGGTGTAG ATGCTAACTT CATGTACTTG AAAGACTATG 1800 

TCAAGGAGTT TGACTACTAT TCACCAGTTA TCATCGCAAA CAACGACTAT CTGAAAGATA 18 6 0 

ACAAAGAAGA AGCTCGCAAA GTCATCCAAG CCATCAAAAA AGGCTACCAA TATGCCATGG 192 0 

AACATCCAGA AGAAGCTGCA GATATTCTCA TCAAGAATGC ACCTGAACTC AAGGAAAAAC 19 80 

GTGACTTTGT CATCGAATCT CAAAAATACT TGTCAAAAGA ATACGCAAGC GACAAGGAAA 204 0 

AATGGGGTCA ATTTGACGCA GCTCGCTGGA ATGCTTTCTA CAAATGGGAT AAAGAAAATG 2100 

GTATCCTTAA AGAAGACTTG ACAGACAAAG GCTTCACCAA CGAATTTGTG AAATAATGAC 2160 

AGAAATTAGA CTAGAGCACG TCAGTTATGC CTATGGTCAG GAGAGGATTT TAGAGGATAT 222 0 

CAACCTACAG GTGACTTCAG GCGAAGTGGT TTCCATCCTA GGCCCAAGTG GTGTTGGAAA 22 80 

GACCACCCTC TTTAATCTAA TCGCTGGGAT TTTAGAAGTT CAGTCAGGGA GAATTGTCCT 2340 

TGATGGTGAA GAAAATCCCA AGGGGCGCGT GAGTTATATG TTGCAAAAGG ATCTGCTCTT 24 00 

GGAGCACAAG ACGGTGCTTG GAAATATCAT TCTGCCCCTC TTGATTCAAA AGGTGGATAA 24 60 

GGCAGAAGCT ATTTCCCGAG CGGATAAAAT TCTTGCGACC TTCCAGCTGA CAGCTGTAAG 2 52 0 

AGACAAGTAT CCTCATGAAC TTAGCGGTGG GATGCGCCAG CGTGTAGCCT TACTCCGGAC 25 80 

CTACCTTTTT GGGCACAAGC TCTTTCTCTT AGATGAGGCC TTTAGCGCCT TGGATGAGAT 2640 

GACAAAGATG GAACTCCACG CTTGGTATCT TGAGATTCAC AAGCAGTTGC AGCTAACAAC 2700 

CCTGATCATC ACGCATAGTA TTGAGGAGGC CCTCAATCTC AGCGACCGTA TCTATATCTT 27 60 

GAAAAATCGC CCTGGGCAGA TTGTTTCAGA AATTAAACTA GATTGGTCTG AAGATGAGGA 2 82 0 

CAAGGAAGTC CAAAAGATTG CCTACAAACG TCAAATTTTG GCGGAATTAG GCTTAGATAA 2880 
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GTAGAAAAAT AGGGAGTTGG TGAAGATTAT CCTTTACCAG CGCCCTTTTT CTTTTAAAAA 2 94 0 

TGAGAAAATT TCGGTATAAT AGTCAAACAA GGTCAAGGTT TAAAGAGAGA GGTGGGTTTG 3000 

TTATGAGATT TAAAAATACA TCGGATCATA TTGAGGCCTA CATCAAGGCG ATTTTAGATC 3 0 60 

AATCTGGTAT CGTGGAGTTG CAACGGAGTC AGTTGGCAGA TACCTTTCAG GTTGTTCCTA 312 0 

GTCAGATTAA CTACGTGATC AAGACACGCT TTACGGAAAG TAGAGGCTAC TTGGTTGAAA 318 0 

GTAAGCGTGG TGGCGGAGGC TACATTCGTA TAGGACGGAT TGAGTTTTCT AGTCATCATG 324 0 

AAATGCTCCG GGAGCTGCTT TACTCGATTG GTGAGCGAGT CAGTCAAGAA ATTTATGAGG 3300 

ATATTCTCCA GCTTTTGGTT GAGCAGGAAT TGATGACCAA GCAGGAGATG AATTTGCTAG 33 6 0 

AATCAGTAGC TTTGGATCGC GTTTTAGGAG AAGAAGCTCC AGTTGTTCGA GCAAACATGC 342 0 

TACGTCAGAT CATACAAGAG GTAGATAGAA AAGGGAAGTA AGATGAACTA TTCAAAAGCA 3480 

TTGAATGAAT GTATCGAAAG TGCCTACATG GTTGCTGGAC ATTTTGGAGC TCGTTATCTA 354 0 

GAGTCGTGGC ACTTGTTGAT TGCCATGTCT AATCACAGTT ATAGTGTAGC AGGGGCAACT 3 600 

TTAAATGATT ATCCGTATGA GATGGACCGT TTAGAAGAGG TGGCTTTGGA ACTGACTGAA 3 660 

ACGGACTATA GCCAGGATGA AACCTTTACG GAATTGCCGT TCTCCCGTCG TTTGCAGGTT 372 0 

CTTTTTGATG AAGCAGAGTA TGTAGCGTCA GTGGTCCATG CTAAGGTACT AGGGACAGAG 378 0 

CACGTCCTCT ATGCGATTTT GCATGATAGC AATGCCTTGG CGACTCGTAT CTTGGAGAGG 3 84 0 

GCTGGTTTTT CTTATGAAGA CAAGAAAGAT CAGGTCAAGA TTGCTGCTCT TCGTCGAAAT 3900 

TTAGAAGAAC GGGCAGGCTG GACTCGTGAA GATCTCAAGG CTTTACGCCA ACGCCATCGT 3960 

ACAGTAGCTG ACAAGCAAAA TTCTATGGCC AATATGATGG GCATGCCGCA GACTCCTAGT 4020 

GGTGGTCTCG AGGATTATAC GCATGATTTG ACAGAGCAAG CGCGTTCTGG CAAGTTAGAA 4 080 

CCAGTCATCG GTCGGGACAA GGAAATCTCA CGTATGATTC AAATCTTGAG CCGGAAGACT 4140 

AAGAACAACC CTGTCTTGGT TGGGGATGCT GGTGTCGGGA AAACAGCTCT GGCGCTTGGT 4 200 

CTTGCCCAGC GTATTGCTAG TGGTGACGTG CCTGCGGAAA TGGCTAAGAT GCGCGTGTTA 4 2 60 

GAACTTGATT TGATGAATGT CGTTGCAGGG ACACGCTTCC GTGGTGACTT TGAAGAACGC 4320 

ATGAATAATA TCATCAAGGA TATTGAAGAA GATGGCCAAG TCATCCTCTT TATCGATGAA 4380 

CTCCACACCA TCATGGGTTC TGGTAGCGGG ATTGATTCGA CTCTGGATGC GGCCAATATC 4440 

TTGAAACCAG CCTTGGCGCG TGGAACTTTG AGAACGGTTG GTGCCACTAC TCAGGAAGAA 4 500 

TATCAAAAAC ATATCGAAAA AGATGCGGCA CTTTCTCGTC GTTTCGCTAA AGTGACGATT 4 560 

GAAGAACCAA GTGTGGCAGA TAGTATGACT ATTTTACAAG GTTTGAAGGC GACTTATGAG 4620 

AAACATCACC GTGTACAAAT CACAGATGAA GCGGTTGAAA CAGCGGTTAA GATGGCTCAT 4 680 
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CGTTATTTAA CCAGTCGTCA CTTGCCAGAC TCTGCTATCG ATCTCTTGGA TGAGGCGGCA 4 7 40 

GCAACAGTGC AAAATAAGGC AAAGCATGTA AAAGCAGACG ATTCAGATTT GAGTCCAGCT 4800 

GACAAGGCCC TGATGGATGG CAAGTGGAAA CAGGCAGCCC AGCTAATCGC AAAAGAAGAG 4 860 

GAAGTACCTG TCTACAAAGA CTTGGTGACA GAGTCTGATA TTTTGACCAC CTTGAGTCGC 4 920 

TTGTCAGGAA TCCCAGTTCA AAAACTGACT CAAACGGATG CTAAGAAGTA TTTAAATCTT 4 980 

GAAGCAGAAC TCCATAAACG GGTTATCGGT CAAGATCAAG CTGTTTCAAG CATTAGCCGT 504 0 

GCCATTCGCC GCAACCAGTC AGGGATTCGC AGTCATAAGC GTCCGATTGG TTCCTTTATG 5100 

TTCCTAGGGC CTACAGGTGT CGGGAAAACT GAATTAGCCA AGGCTCTGGC AGAAGTTCTT 516 0 

TTTGACGACG AATCAGCCCT TATCCGCTTT GATATGAGTG AGTATATGGA GAAATTTGCA 522 0 

GCTAGTCGTC TCAACGGAGC TCCTCCAGGC TATGTAGGAT ATGAAGAAGG TGGGGAGTTG 5280 

ACAGAGAAGG TTCGCAATAA ACCCTATTCC GTTCTCCTCT TTGATGAGGT AGAGAAGGCC 53 4 0 

CACCCAGATA TCTTTAATGT TCTCTTGCAG GTTCTGGATG ACGGTGTCTT G AC AG AT AG C 54 0 0 

AAGGGACGCA AGGTCGATTT TTCAAATACC ATTATCATTA TGACATCGAA TCTAGGTGCG 54 6 0 

ACTGCCCTTC GTGATGATAA GACTGTTGGT TTTGGGGCTA AGGATATTCG TTTTGACCAG 552 0 

GAAAATATGG AAAAACGCAT GTTTGAAGAA CTGAAAAAAG CTTATAGACC GGAATTCATC 5580 

AACCGTATTG ATGAGAAGGT GGTCTTCCAT AGCCTATCTA GTGATCATAT GCAGGAAGTG 5640 

GTGAAGATTA TGGTCAAGCC TTTAGTGGCA AGTTTGACTG AAAAAGGCAT TGACTTGAAA 5700 

TTACAAGCTT CAGCTCTGAA ATTGTTAGCA AATCAAGGAT ATGACCCAGA GATGGGAGCT 5760 

CGCCCACTTC GCAGAACCCT GCAAACAGAA GTGGAGGACA AGTTGGCAGA ACTTCTTCTC 5820 

AAGGGAGATT TAGTGGCAGG CAGCACACTT AAGATTGGTG TCAAAGCAGG CCAGTTAAAA 58 80 

TTTGATATTG CATAAAAGAA TAAAAGTATC AGCATCTGAC CATAAGTCAC AGTGGAGTGA 594 0 

AATTCAATGA AAAT C AAAGA GCAAACTAGG CAGCTAGCCG CAGGTTGCTC AAAACACTGG 6 000 

TTTGAGGTTG CAGATAGAGC TGACGTGGTT TGAAGAGATT TTCGAAGAGT ATGAAACTAA 6060 

AACCTATAGC TTCTAAACGA TCCGTGGTTT TCATCATTCA ACACAAAATT CATATGTTTA 6120 

TTACCCTCCG TCGTATTTGT CTTAGAGCGT GTGTAGTAGA AAAAGAG C AG TCTTATCTGA 6180 

AATTTTTATT CTTTCAAAAG AGACCTGTTT CTTTTTTGCA TGTCAAATCC GTTCTAGCTG 6240 

GTATTTGAAA AATCAAACTA ATATTCAATG AAAATCAAAG AACAAACTAG GAAGCTAGCC 6300 

GCAGGTTGCT CAAAACACTG TTTTGAGGTT GTAGATAGAG CTGACGTGGT TTGAAGAGAT 63 60 

TTTCGAAGAG TATAAGCTGC AAGATGAATG ATTTTCTTGT ATTGACGTTG TTGTTGACAA 6420 
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TCAAATAGCA TACAGGAAAT AGATGTATGG AGTTCTGGTA GTAGAAAGGG AGAGAGATGA 6540 

ACATTTTAGT TGCAGATGAC GAGGAAATGA TTAGAGAAGG AATTGCAGCA TTTCTGACAG 6600 

AAGAGGGTTA TCATGTCATT ATGGCTAAGG ATGGACAAGA GGTCTTGGAA AAATTTCAAG 6 6 60 

ATCTCCCTAT CCATCTCATG GTACTGGATT TAATGATGCC TAGGAAGAGT GGTTTTGAAG 672 0 

TGTTAAAAGA AATCAATCAA AAGCACGATA TTCCTGTCAT CGTCTTGAGT GCTCTGGGAG 67 8 0 

ATGAAACTAC TCAGTCACAG GTATTTGATC TCTATGCTGA TGATCATGTG ACAAAACCTT 6840 

TTTCTTTGGT ACTGCTTGTC AAGCGTATTA AGGCGCTTAT CAGACGTTAC TACGTCATAG 6900 

AGGATCTTTG GCGATATCAG GATGTAACAG TGGATTTTAC CTCTTACAAA GCACATTATA 69 60 

AAAATGAAGA AATTGATCTC AAACCAAAGG AATTACTGGT ACTAAAGTGT TTGATTCAGC 702 0 

ATAAAAATCA AGTTTTAAGT AGAGAGCAGA TATTGGAAGA AATTTCAAAA GATGTAGCTG 7080 

ATTTACCTTG TGATAGGGTC GTTGATGTCT ATATTCGTAC TCTTCGCAAA AAATTAGCTT 7140 

TAGATTGTAT CGTGACTGTG AAAAATGTTG GGTATAAGAT TAGCTTATGA TAAAAAATCC 72 00 

TAAATTATTA ACCAAGTCTT TTTTAAGAAG TTTTGCAATT CTAGGTGGTG TTGGTCTAGT 72 60 

CATTCATATA GCTATTTATT TGACCTTTCC TTTTTATTAT ATTCAACTGG AGGGGGAAAA 732 0 

GTTTAATGAG AGCGCAAGAG TGTTTACGGA GTATTTAAAG ACTAAGACAT CTGATGAAAT 7380 

TCCAAGCTTA CTCCAGTCTT ATTCAAAGTC CTTGACCATA TCTGCTCACC TTAAAAGAGA 7440 

TATTGTAGAT AAGCGGCTCC CTCTTGTGCA TGACTTGGAT ATTAAAGATG GAAAGCTATC 7500 

AAATTATATC GTGATGTTAG ATATGTCTGT TAGTACAGCA GATGGTAAAC AGGTAACCGT 7560 

GCAATTTGTT CACGGGGTGG ATGTCTACAA AGAAGCAAAG AATATTTTGC TTTTGTATCT 762 0 

CCCATATACA TTTTTGGTTA CAATTGCTTT TTCCTTTGTT TTTTCTTATT TTTATACTAA 7680 

ACGCTTGCTC AATCCTCTTT TTTACATTTC AGAAGTGACT AGTAAAATGC AAGATTTGGA 7740 

TGACAATATT CGTTTTGATG AAAGTAGGAA AGATGAAGTT GGTGAAGTTG GAAAACAGAT 7800 

TAATGGTATG TATGAGCACT TGTTGAAGGT TATTTATGAG TTGGAAAGTC GTAATGAGCA 786 0 

AATTGTAAAA TTGCAAAATC AAAAGGTTTC CTTTGTCCGC GGAGCATCAC ATGAGTTGAA 7920 

AACCCCTTTA GCCAGTCTTA GAATTATCCT AGAGAATATG CAGCATAATA TTGGAGATTA 7980 

CAAAGATCAT CCAAAATATA TTGCAAAGAG TATAAATAAG ATTGACCAGA TGAGCCACTT 8040 

ATTAGAAGAA GTACTGGAGT CTTCTAAATT CCAAGAGTGG ACAGAGTGTC GTGAGACCTT 8100 

GACTGTTAAG CCAGTTTTAG TAGATATTTT ATCACGTTAT CAAGAATTAG CTCATTCAAT 8160 

AGGTGTTACA ATTGAAAATC AATTGACAGA TGCTACCAGG GTCGTCATGA GTCTTAGGGC 8220 



WO 98/18931 



PCT/US97/19588 



863 

ATTGGATAAG GTTTTGACAA ACCTGATTAG TAATGCAATT AAATATTCAG ATAAAAATGG 82 80 

GCGTGTAATC ATATCCGAGC AAGATGGCTA TCTCTCTATC AAAAATACAT GTGCGCCTCT 8340 

AAGTGACCAA GAACTAGAAC ATTTATTTGA TATATTCTAT CATTCTCAAA TCGTGACAGA 8400 

TAAGGATGAA AGTTCCGGTT TGGGTCTTTA CATTGTGAAT AATATTTTAG AAAGCTATCA 8460 

AATGGATTAT AGTTTTCTCC CTTATGAACA CGGTATGGAA TTTAAGATTA GCTTGTAGAC 852 0 

AGATTAGTTT TTTATTAAAG TTCATATAGG GTTAACATAA GTGTGTTATT CTTTGTGTAG 8580 

ATAAAAGAAA GGATACTAAT ATGGTATTAG CGATTATTTT AGTAACATTC TTTATTCGAT 8640 

TGATTTTTTT AAAGCGTTCG ATAGAGAATG AGAAACGAAT CCTTAGCAAT GGCGGGG 8697 
(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4317 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

AACCATACAT ACGGCAAGGC AAAGCTGACG CGGTTTGAAG AGATTTTCGA AGAGTATTAG 60 

TTGCCTTTAA AGGCATCCAC CATCGTTTGA AATTCTTCAT TTGAGAGAGT AATCCCTTTG 12 0 

CCCATTTTAG TATGGTCTGG ACTCCAAGCA CGAATATCAA ACTTTGCAGG GGCACCATTA 180 

AAGCTCACAC GGTTAATTTC CTTGGTCCAA CCTTTTTCGT TTTCAGAAAG AGTCAACAAG 240 

TGCTCTTCGA TTTCAAATGT AAATTCTGCC ATTTTCTTCT CCTTTTTTAG TTTCATTAGT 300 

TTATTCGTAA AATCTTGTAG ATTTTAGGAA AATTTTATAT AATATTGATA TAAAAGAAGG 360 

GAGGCCAATA TGAGACATAA ATTCCAGCAA GTTCTAAATA AAATACATGA TTTTTTAAAT 420 

GGATATGACC AACCTGACCA GACTGAAACC AACTCCCTTA CAGCCACTAT TGAAGAGGCT 480 

ATCCAGAAAC AAACCGCTGT TCACCTTATC TTGTCTGAGA CAAGCTTTAC AGGTGACATC 540 

ATCAAATATG ATCAGCAAGG CCAGCAAATT ATCGTGAAAA ATTTTTCCAA AAATGTGAGC 600 

CGGATTATCC GTATAAGCGA TATTCAACGC CTGCGATTTG TCCCCTCAAC TGTCCAAACA 660 

GCCCAAAAAA ATAGATTTAA GAAAGAGTGA GATGTAGTTG CTTCATCCCA CTCTTTTTTC 720 

TTAGCGAATT TGTTCAAAAT GTAAATGAAC TGCGATATGA TCTCCATAAC CACTTCTTTC 780 

CAAGTCACGT TGTAAACGAT AGGAAATGTA GTGTTCTGCA ATGGTAATGT AACCTGCGCC 840 

CAATAAACGA TGTTCAACCA TAGATTGAAT CATACTGATA GTCGCACGTT CCACCTTGGC 900 
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TTCTTGTAAA TCCAAAACTA CCTTCTTAGT GACTTGAGCA AGATTTTGAC GCAAATCATC 9 60 

TGTCAAAACA TAAACAGTTT GGGCTGCCTT CAAGATGGCT TGGTAAATCT TATCTGGATT 102 0 

AAATTCAGCA ATTTCGCCAT TACGTTTGAT TACTTGCATA GGTTTCTCCT TTATTCTTTG 1080 

TTTTCTTTGA TTTCTGCCAG CATTTTTTCT TCTTCTACTG TCAGTTGATA ATGTTCAAGT 1140 

AAATCCGGTC TGCGCTCGTA GGTTTTCTTT AAACTCTCGT ACAATCGCCA CTGACGAATC 12 00 

TTTTCATGGT GGCCACTCAT CAATACATCT GGCACGACCA TGCCTCGATA ATCATAGGGA 12 60 

CGTGTGTACT GAGGATATTC TAAAAGACCT GAAGAAAAAC TATCATCTTG GTGGCTAGAC 132 0 

TCCTTGCCAA TCACTTCTGG AATCAGGCGA ACTGTAGCAT CAATCATGGT CATAGCTGCC 1380 

AATTCTCCAC CAGTGAGGAC ATAGTCACCT AGGGAAATCT CATCTGTTAC CAAGGTCTTA 144 0 

ATGCGCTCAT CATAACCCTC ATAGTGCCCA CAGATAAAGA TTAGCTCTTC CTCTTGAGCC 15 00 

AAATCTTCAG CATAAGCCTG ATCAAACTGC TTTCCAGCAG GATCAAGGAG AATAACGCGC 15 60 

GGATTTTTCT TTTCAATAGC ATCAAAGGAA TCGAAAATAG GTTGTGCTCT GAGCAACATG 162 0 

CCCTGACCGC CTCCGTAGGG CTCATCATCT ACATGACGGG CCTTTTCAGC ATTTTCTCGA 1680 

AAATTATGAT ACTGGATATC CAAGAGCCCT TTTTCTCGAG CCTTTCCAAC GATTGAGTGC 174 0 

TCCAGTGGAG AAAAC AT C T C TGGAAAGAGG GTTAAAATAT CAATCTTCAT CGTCTAACCC 180 0 

TTCTAAGATT TCCACATCGA CCCGTTTACT TGGAATATCA ACATTGAGAA CCACTGGTGG 1860 

GATATAAGGT AAAAGCAAAT CACGTTTGCC TTTTCGTTTG ACCACCCAGA CATCATTAGC 1920 

ACCTGGTTGC AGGATTTCCT TGATGGTTCC AACCAAGCTA TCACCCTCAT AGACTTCCAA 1980 

ACCGATAATC TCGTGATAGT AAAATTCACC ATCGTCTAGG TCATTCAAAT CTTCCTCAGC 2 04 0 

GACCTTGAGA CTGTATCCCT TGTACTTTTC GATAGTATTG ATATGGTACA TATCTTTGAA 2100 

TTTAATAATG TCAAAGTTCT TCTGTTTACG GTGGCTAGCG ATGGTCACTG TTTGGACAAA 2160 

CTGATCTTTT TCATCAAACA AAACCAGCTC AGCTCCTTTT TTAAACCGTT CTTCTGCAAA 222 0 

ATCCGTCACA GACAAGACTC GCATCTCCCC CTGTAATCCC TGCGTATTAA CGATTTTCCC 2280 

AACATTAAAG TAGTTCATCT TGTCTCCTGT AATCTCCTTT TTTCCATCTT ATTCTAACAA 234 0 

TTCTCGAATA ATAGCCGCAA TTTTTTCCGA TTCTGACCAT TGTAAATAAT GGTGATTCCC 2 400 

TCCTAAAATG AGTTTAGTAT TGGAAGTCCA ATATTCTGAT TCTCTGTACT CTTTTTCTCT 2 460 

ATAAGGCTGA CAAAAAACAA ATACAGGAAT ATGAGCTTCT ATAGATACAT CCTCAAAATC 2520 

TTCCTCAGTA ATCTCTCCAG ATATCTGAAA TTCTGGATCT TGATTTTCCA ACTCTAAGCC 2580 

TTTTTCTTGC ATTAATTCCC AGATTTTTTT ATTCGTTTCA GGACTAAATG TTGCTTGAGT 2 640 

TAAGTTCTTA AAATAAAGTT CAGGACCACA CTCGTCAATC AGCCTCATCT GCTCTTCCAT 2 700 
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TTCTGGATAA GGATTTTCTG AAAAATCAGC AAACATGACT TTTTTAGTTG TCGGTTCAAT 27 60 

TGCTACTAAA GTCTGACGCT TAATTGGTTT CTCGAGTAAT TTGCAAGCTA AAATTCCACT 282 0 

CCAACTATGT GCACAAAGTA TATATTCAGA AATTCCTAAT TCTTCAAGTA CTTCATAAAC 2880 

CGCATCTGCA AGATTATCTA GATTTTTTCC AGCTTGGTCA TGAATCGGAC TCCTACCTGT 2 94 0 

GTTCGGAAAA TCAATTGTCA AATAACCAAT TGTAGGAGGA GGTTTTTCAA GTATAAGTGA 3000 

AAAATTTTCA TAACTTGGTA GCAAACCTGC TCCGTTTAAA CAAACTAGCA CTTTCTTTTG 30 60 

CTTTTGATAA GTAACAGAGA GGCTACCAAT TTCTGTAGAT ACTTCAAACC TCTTCATAAA 3120 

GAAATCCACT GATTCTATAT AATGAATTAT TAAAAATCCT TATCCTTTAT TTTATCACGT 3180 

TCCAAGGATT TTCTCAAGTT GGAGGAAGGG GACAATATCT CTACTTTCCC TTCAATAATC 3240 

CTTCCAAATT ATGTTTATGT TGGTAATTAA TGGCTGCGGT TTTGTCTTTC TCAAAGACAG 33 00 

TCTTGGTAAG GTCAATATGA TTAATAGCTA CGATTGCGAC GGTGTAGTAA ATGATATCAG 33 60 

CCAGTTCTCT GGCAAGTTCC TCGTTCGAAT CCTATCCCTT CTTTTCGACC AGAGCGCCTA 342 0 

TTCAAAACCT CGACTACTTC TCCGACTTCC TCCACTAACT TCATAAAGAG ACCTTCATCA 3480 

GTCCGAGACT GCTGTTAATG TTCGATTAAG TAGTCTTGGA ATTGCCTAAA CGTTCAATCT 3 54 0 

TTTATAGTAT ATTGAAACTA GAATAGTACA CCTTTACTTC TAAAACATTG TTAGAAATCG 3600 

ATTTGACTGT CCTGATCGAT TTGTCCTGTT CTTGTTTCAT TTTACTATAT CTTCTATTCC 3660 

ACACAAAAAA GCGAGACATC CGTCCCGCCC TTCTTATTTT TCGTCAATAA CGATTCTTAC 3720 

TTTTTTGTAT TCAGTTGGGA CAGAGTAGAC AATCGTTCTT ATCGCAGAAA TAGTGCGACC 3780 

CTTACGACCG ATTACACGAC CCACATCGCT TTGATCAAGA TTCAAATGAT ATTCCAAAAA 3840 

TTCTGGTGTA TCCTCAATCT TGATAGTTAA GGCATCTGGT TGTGAAATTA AGGGTTTCAC 3 90 0 

AATCGCAATA ATGAGATTTT CAATCGTATC CATCTGTCAA CCTACTTTAA ACTTATTTTG 3 9 60 

AAAATTTAGA ATCGTGGAAT TTTTTCAATA CGCCTTCTTT TGAAAGGATG TTACGTACTG 402 0 

TGTCTGAAGG TTGAGCTCCA TTAGCCAACC ATGCAAGAAC GCGGTCTTCT TTCAAAGTTA 408 0 

CTTGGTTTTC AGCAACAAGT GGGTTGTAAG TTCCAACTGT TTCGATGAAA CGTCCGTCAC 4140 

GTGGTGAACG TGAATCTGCT ACGTTGATAC GGTAGAAAGG TTTTTTCTTA GAACCCATAC 4200 

GAGTCAAACG GATTTTAACT GCCATTTTTA AAGTCTCATT TCTTTAATTT TTTATTTCGG 4260 

TGAAATAGCT GAGCTATTTA GCACATGTTC TATTATAGCA GATTTCTGGC ATGTGTC 4317 

(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4881 base pairs 
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(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
AATTTATTTG ACTGGAAATT GTAGAGGGTT CTCGAAATTT CTTGAATGGT TAAAATAAGG 60 
ACAAGAGAAA ACATGGATAT CTATATCCTT GTGCCAAAAA AACCACTGCC CTCCCCAGAC 120 
CAACCTGAGG AAAGCAGTGA TTCTTATTTT AGGAGTTAGG AATGAATACA CGAAATCAAT 180 
TTAGCTGATT ATTTTTTGTT TTTCAAGAAT TCATCGTATT GTTTTTGCAT TTCGTTCAAT 2 40 

ACTTTTTCGT AGGCACCTTC AGATTTCAAT TTTTCCATCA ATTCTGGAAT CGCTTTATCT 300 
GGGTCTACAG TACCAGTGTT GATAGCTGTA TCAAATTGTT GCATTGTGTT AGCAATAGCT 3 50 

GAGATTTCAG ATTTCACATT GTCAGTATTG AAGATAAATC CAAGCGCTGG AGATTCTTTA 420 
GCTTCTGCCA ATTCTTTCTT AGAATTTTCG ATTTGTTGGT CTGTAACGTT TTCGTTGATG 480 
TAAAGGATCC AGTTGTTACC AGTGTTCCAT CCACCCATGT GAGTGTTTCC TTTGTAGCCA 540 
TCAAGAACGC GAACACGGTT TTCTTTACCT TCAATTTTTT CCCAGTTCTT GCCTTCTGGA 6 00 

CCGTAAACAA GACCGTTCAA GAGTTCTGGG TTCGTATTCA AGAGGTTCAA GATTTCCATT 6 60 

GATTTTTCTT TGTTCTTAGA GTTGTTTGAG ATGACAAAGT TAGCAACTTG TGTTGTTTGG 720 

TTTTTCTTGA TGAAGTTAGT AATTGGTTTG ATTTGGATAT CTTTGTTGGC AACACGTGAA 780 

AGCAAGCTGT TACCGTAGTC AGCTGGTCCT ACTGTTTCTT CACGAACGAA CCAAGTATCT 84 0 

TGTTGAAGGT CAAAGGAAGT ATCGCTTGTT GCGACGTCTT TTGGAATGTA GCCAGCTTCA 9 00 

TAGAATTTGT GAAGAGTCTT CAAGTGTTCT TTGAAACGAG GCACTTCGTA ACGGTTTACA 9 60 

ACTTTAGTAG TATCGCCTTC AAGGTCGATA ACGAATGGAA GACCGTTTGC TACTGGGTAG 102 0 

TCAAAATTAT CAGATGGGAT GAAAACTTTA CCAATAGCAA ATGGTACTAC GTCTGGAGCT 1080 

TTTTCTTTGA TTTGTTTCAA GACTGGCTCA AGAGTTTCGT AAGAAGTAAC ACCTGAAATA 1140 

TCGATACCAT ATTTAGCAAG GAGAGTTCCG TTGAAGGCAA AGTTTTGAGA TGATGCAACG 12 00 

TTGGCTGCAA CTGGAACAGC GTAAATCTTA CCATTTACAG TATTACCCTT GATGTAAGCT 12 60 

GGGTCAAGTG CTTTGTAAAG GTCTTTACCT TCTTTTTTGT ACAATTCTGT CAAGTCAGCG 132 0 

TAAGCACCTT TTTGAGCATT TACAATATAG TTATCTGCAA AGGCAATATC ATAGTTTTCA 13 80 

CCAGATGATG TGATAACTGA CATTTTCTTA CCATAGTCAC CCCAGCCAAG GTATTGGATA 144 0 

TCCAATTTGG CACCAACTTT TTCTTCAATG ATTTTGTTGG CATTTGCTAA CAATTCATCC 1500 

AAGTTGTCTG GTTTGTCACC GATTTGGTAC ATTTTGATAA CAGGTTTGTC ACCTGAATCA 15 60 
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GCAGCTTTTT TGCTGTTACC TGTCAAATTT CCACAAGCAG CAAGACCTGC AGCCAGAGCG 1620 

ACTACACTAG CAGATGCAAA AGCATATTTT TTCCAGTTTT TCATGATAAA AACTCCTTTT 1680 

TTTATTTTTA AACTTATAAA CAATGTAATG ATCTTATACT CAATAAAAAT CAAAGAGCAA 1740 

ACTAGAAAAC TAGCCGCAGG CTGCTCAAAG CACTGCTTTG AGGTTGTAGA TAAGACTGAC 1800 

GAAGTCAGTT ACATATATCT ACGGCAAGGC GACGTTGACG CGGTTTGAAT TTGATTTTCG 1860 

AAGAGTATTA ACTTCACACA AGGGAAGTTG GGAACTGAGA AATGTTATTT CTCAATAAGC 192 0 

ACTATTCTTT CACACCACCG ATAGTCAAAC CTTTTACAAA GTAGCGTTGG AAAAATGGAT 1980 

ACAAAATCGC GATTGGAAGG GTTGCAACCA CAACCATGGC CATACGACCT GTTTCTTTCG 2 040 

GTAGAGCAAC TCCCAGTTGA CCAATCAAGC CGACCGCTTT GGCAATGTAG TCCATATTTT 2100 

GTTGGATTTG CATGAGCAAA TATTGCAATG GATACAAGTT GTCACTCTTG ATGTAAAGAA 2160 

GGGCGTTGAA CCAGTCATTC CAGAAACCAA GAGCTGTTAA GAGCGTGATG GTTGCGATAC 222 0 

CTGGTAGTGA CAATGGCAAA CAGATTTGGA AGAAAATCCG GGCCTCACTG GCACCATCGA 22 80 

TACGAGCCGA TTCTAGAATG GCTTCTGGAA TGGTCTTCTT GAAGAAGGAA CGCATCAAGA 234 0 

TGATGTTAAA TGGTGAGAGA AGCATTGGAA CAATCAAGGC CCAAACAGTG TCACCAAGCT 2400 

GAAGTACACG GGTCACCATG ATATAACCTG GTACCAAACC AGCGTTGAAC AACATACTGA 24 60 

GAAGGACGAA GATGGTAAAG AATCTGCGAT ACTTAAAGGT TGTCCGTGAA ATAGCGTAGG 2520 

CATAGGTTGT TGTGATAAAG ACATTTGTCA ATGTCCCAAC TACGGTTACA AAGACAGAGA 2580 

TGAAGAGGGC TTGTAGGATT TTATCCTTAA ACTGTGCCAA AAACTCAAAA CCGTCTAAGC 2 640 

CAAATTGGGA TGGGAAGAAG CTATAGCCGT ATTGGAGGAG GCTTTTCTCG TCTGTCACTG 2700 

AAATAATGAT AACGAATACA AAAGGTAGGA TACAAGAGAG GGCAATCAAA CCCGAAATGA 27 6 0 

TACTGAAGAA GATATCTGCT TTCTTACTGA AGGAGTGAAT GCCGACATTA TCAATTTTTT 282 0 

CTTTTTTAAT TTTCTTTTTT GCCATATTCT CCTCCTTTCT AGAACAAAGC TGAGTTTGGA 2880 

TCGACTCGTC TTGCAAGCAA GTTTGATAGG ATAACCAGAA TCAAACCAAC AACGGATTGG 2 94 0 

TAAAGACCGG CTGCTGCAGC CATACCGATA TCTGCTGTCT GAGTCAAACC ATTAAAGACA 3000 

TATACGTCCA AAACGTTGGT TACATTGTAA AGCTGACCAG CATTGTGTGG GATTTGATAG 3060 

AAGAGACCGA AGTCTGCGCG GAAGATATTT CCGACTGCAA GGATGGTCAA TACAGTTACA 3120 

AGCGGAGTCA ACTGAGGAAT GGTTACGTTG CGAATACGTT GCCACTTGCT AGCTCCGTCC 3180 

ACTGTCGCTG CTTCGTAGTA GGTTGGATCA ATTCCCATGA TCGTCGCATA GTACATGACA 324 0 

CTGCTATATC CAAAGCCTTT CCAAATACCT AGGAAAAGTA GGAGATAGGG CCAGATGCCC 3300 
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AGGTCAGCGT AGAAATTGAC TTCTTTGAGA CCAAGACTTT CCAATAGATG ATTGAACACC 33 60 

CCTTTATCAA TATTTAGGAA GGCATCTGTA AAGAAACTGA TGATAACCCA AGACAAGAAG 3420 

TAAGGGAACA ACATAGAAGT TTGAAAAATC TTCACCATTC TCTTAGAACG GAGCTCGCTG 34 80 

AGGATAATGG CAATCCCTAC AGATACAACT AAACCTAGAA AGATAAAGCC AAGATTGTAG 3540 

AGGACAGTAT TTCGTGTGAT AATAAAGGCG TCTCTTGAAC TAAATAAGAA TCTAAAATTA 3600 

TCGAGTCCGA CCCATTTACT ATTTATGATA CTATCTATGA AACCATTACT GGTCATGTGG 3 6 60 

TAGTCTTTGA AGGCAACCAC GTTCCCAAAT ACTGGAATGT AAAAGAATAG AATCAACCAG 3720 

AGTGCCCCTG GCAAAACCAT CAAGAGAAAG ATCCAGTTGT CTCTCAATGT TTTTGAAAAC 3780 

TTTTTCATAA TTTCCTCCCT TTTTATTTTG ATATCCATCT AAAAATTCTT TTTTAGACTT 3840 

TTGATAACGA TTACATTATT AGTATACTCC TATTTGCAGG TTAGGTTAAA CTCCTAATTA 3900 

TAGAAAAAAC TCCACAAATT ATGTAGCAGA TTTAAAACTT TATCACCACT ATCAAACAAA 3 960 

TGTCCTAAAT CAATTGTTTA TTTTATCTCT ATTAGCCCAG TGATGGCGTC ACTCTGTTAT 402 0 

AAGCATCCAA CAACGGGGTA TACTGAAAAA TCTCCAGACT AGGGAACTCA GCGATAGTTC 4080 

CTAATCTGGA GATTTTTAAT ATGTTATTAG GCGTTTGCTT TCAACTTAGC AATAACCTCT 4140 

TTAAGATTAT CAATCAACTC TGCTGCAGTA TGCTCAGAGC CTTTTTCATC TGCCAAGAAC 4200 

AAAACTGCTT TTTGAAGTTC TTTTTGAGAG TTTTCAAGGA CATCCTTATC TACTGTTTCA 4 2 60 

AGGTTTGAGT CTTTAAGAAG TTTACTTAAT TCCTTGGCTA ATTTCTTGAG TTTGATTTGC 4320 

AGACTCATCT TCTCCTGCTG TTTCTTTGCC CGCTGTTTGT CCTCCATCCT TAGTTGCTGA 4380 

CTGGCTTTCC TTAATGGACT CTAGGGAAGC AATGGCATCT TTGACTGTTT GCAAGATATC 4440 

ACGTAAACCT TGCTCTGTCA AACTATCATC TGCAAAAGCT TTATTAGCCT CTGCCAAAAC 4 500 

CAGACGTGCT GAATCTGTGG TAGGATTCGA TACACCTGTC AATGATCTCA AAAGATTTTC 4 560 

TAAGGTTTGA GTCTGCTTAC TAATACTAGA CTAAAATCAA AAAGTATTAT ATAACAGTGA 4 62 0 

TATGAAATCA ACTAAAGAAG AAATCCAAAC CATCAAAACA CTTTTAAAAG ACTCTCGTAC 4 680 

AGCTAAATAT CATAAACGCC TTCAAATCGT TCTATTTTGT CTGATGGGCA AATCTTATAA 4740 

AG AG AT TATA GAACTTTTAT AGTAGTTTGA AATAAGATGT GAACATCTCT ATCAGGAAAG 4 800 

TCAAATTAAT TTATAGAAAT ATTTTAGCAG CCAAGGTGTA CTGTTATAGA TTCAATACAC 4 860 

TATACTTGGT GGTTTAGCTC G 4881 
(2) INFORMATION FOR SEQ ID NO: 12 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13121 base pairs 

(B) TYPE : nucleic acid 



WO 98/18931 



PCT/US97/19588 



869 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

AGGATCCCCG GAAAAGGAGA CTAAAAATGA AGAAAAAATT TCTAGCATTT TTGCTAATTT 60 

TATTCCCAAT TTTCTCATTA GGTATTGCCA AAGCAGAAAC GATTAAGATT GTTTCTGATA 12 0 

CCGCCTATGC ACCTTTTGAG TTTAAAGATT CAGATCAAAC TTATAAAGGA ATTGATGTTG 180 

ACATTATTAA CAAAGTCGCT GAGATTAAAG GCTGGAACAT TCAGATGTCC TATCCTGGAT 240 

TTGACGCAGC AGTCAATGCG GTTCAAGCTG GGCAAGCCGA CGCTATCATG GCAGGGATGA 3 00 

CAAAGACTAA AGAACGTGAA AAAGTCTTCA CCATGTCTGA TACTTACTAT GATACAAAAG 360 

TTGTCATTGC TACTACAAAG T C AC ACAAAA TTAGCAAGTA CGACCAATTA ACTGGCAAAA 42 0 

CCGTTGGTGT TAAAAACGGA ACTGCCGCTC AACGTTTCCT TGAAACAATC AAAGATAAAT 480 

ACGGCTTTAC TATTAAAACA TTTGACACTG GTGATTTAAT GAACAACAGC TTGAGTGCTG 54 0 

GTGCCATCGA TGCCATGATG GATGACAAAC CTGTTATCGA ATATGCCATT AACCAAGGTC 600 

AAGACCTCCA TATTGAAATG GATGGTGAAG CTGTAGGAAG TTTTGCTTTC GGTGTGAAAA 660 

AAGGAAGTAA ATACGAGCAC CTGGTTACTG AATTTAACCA AGCCTTGTCT GAAATGAAAA 72 0 

AAGATGGTAG TCTTGATAAA ATTATCAAGA AATGGACTGC TTCATCATCT TCAGCAGTGC 780 

CAACTACAAC TACTCTCGCA GGATTAAAAG CTATTCCTGT TAAGGCTAAA TATATCATTG 84 0 

CCAGCGATTC TTCTTTTGCC CCTTTTGTTT TCCAAAATTC AAGCAACCAA TACACTGGTA 900 

TTGATATGGA ATTGATTAAG GCAATCGCTA AAGACCAAGG TTTTGAAATT GAAATCACCA 960 

ACCCTGGTTT TGATGCTGCT ATCAGTGCTG TCCAAGCTGG TCAAGCCGAT GGTATCATCG 1020 

CTGGTATGTC TGTCACAGAT GCTCGTAAGG CAACTTTTGA CTTCTCAGAA TCATACTACA 108 0 

CTGCTAATAC CATTCTTGGT GTCAAAGAAT CAAGCAATAT TGCTTCTTAT GAAGATCTAA 114 0 

AAGGAAAGAC AGTCGGTGTT AAAAACGGAA CTGCTTCTCA AACCTTCCTA ACAGAAAATC 1200 

AAAGCAAATA CGGCTACAAA ATCAAAACCT TTGCTGATGG TTCTTCAATG TATGACAGTT 12 6 0 

TAAACACTGG TGCCATTGAT GCCGTTATGG ATGATGAACC TGTTCTCAAA TATTCTATCA 132 0 

GCCAAGGTCA AAAATTGAAA ACTCCAATCT CTGGAACTCC AATCGGTGAA ACAGCCTTTG 13 80 

CCGTTAAAAA AGGAGCAAAT CCAGAACTGA TTGAAATGTT CAACAACGGA CTTGCAAACC 1440 

TTAAAGCAAA CGGTGAATTC CAAAAGATTC TTGACAAATA CCTAGCTAGC GAATCTTCAA 1500 

CTGCTTCAAC AAGTACTGTT GACGAAACAA CGCTCTGGGG CTTGCTTCAA AACAACTACA 1560 
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AACAACTCCT TAGCGGTCTT GGTATCACTC TTGCTCTAGC TCTTATCTCA TTTGCTATTG 1620 

CCATTGTCAT CGGAATTATC TTCGGTATGT TTAGCGTTAG CCCATACAAA TCTCTTCGCG 16 80 

TCATCTCTGA GATTTTCGTT GACGTTATTC GTGGTATTCC ATTGATGATT CTTGCAGCCT 1740 

TCATCTTCTG GGGAATTCCA AACTTCATCG AGTCTATCAC AGGCCAACAA AGCCCAATTA 1800 

ACGACTTTGT AGCTGGAACC ATTGCCCTCT CACTCAATGC GGCTGCTTAT ATCGCTGAAA 1860 

TCGTTCGTGG TGGTATTCAG GCCGTTCCAG TTGGCCAAAT GGAAGCCAGC CGAAGCTTGG 1920 

GTATCTCTTA TGGAAAAACC ATGCGTAAGA TTATCTTGCC ACAAGCAACT AAATTGATGT 1980 

TGCCAAACTT TGTCAACCAA TTCGTTATCG CTCTTAAAGA TACAACTATC GTATCTGCTA 2 040 

TCGGTTTGGT TGAACTCTTC CAAACTGGTA AGATTATCAT TGCTCGTAAC TACCAAAGTT 2100 

TCAAGATGTA TGCAATCCTT GCTATCTTCT ATCTTGTAAT TATCACACTT TTGACTAGAC 2160 

TAGCGAAACG CTTAGAAAAG AGGATTCGTT AATGGCAAAA TTAAAAATTG ATGTAAATGA 2 2 20 

TTTACACAAG CACTATGGAA AAAATGAAGT CCTAAAAGGA ATTACGACTA AGTTCTATGA 22 80 

AGGAGATGTT GTTTGTATCA TCGGTCCTTC AGGTTCTGGT AAGTCAACTT TCCTCCGTAG 2 340 

CCTCAATCTT TTAGAAGAAG TCACTAGCGG TCACATCACT GTGAACGGCT ATGATTTAAC 2 4 00 

TGAAAAAACA ACCAATGTTG ACCACGTCCG TGAAAATATC GGCATGGTAT TCCAACACTT 2 4 60 

CAACCTCTTC CCTCATATGT CTGTATTGGA CAACATCACC TTTGCTCCTA TTGAGCACAA 2 52 0 

GTTGATGACT AAGGAAGAAG CTGAGGAATT GGGAATGGAG TTGCTTGAAA AGGTTGGACT 2580 

AGCAGATAAA GCTAATGCCA ATCCAGATAG CCTATCAGGT GGTCAAAAAC AACGTGTGGC 2 640 

CATCGCTCGT GGCCTAGCAA TGAATCCAGA CATCATGCTC TTCGATGAAC CAACTTCTGC 2700 

CCTTGACCCT GAGATGGTTG GAGACGTACT TAACGTTATG AAGGAATTGG CTGAGCAAGG 27 60 

CATGACCATG ATTATCGTAA CCCATGAGAT GGGATTTGCT CGTCAGGTTG CCAACCGCGT 2 82 0 

TATCTTTACT GCAGATGGCG AGTTCCTTGA AGACGGAACA CCTGACCAAA TCTTTGATAA 2 880 

CCCACAACAC CCTCGTCTGA AAGAGTTCTT AGATAAGGTC TTAAACGTCT AAACTCAAAC 2 94 0 

TGTAAGGATT TCCTTGCAGT TTTTCTACCT CGTATTGGAA TTTTTGATTT TTCGGAAAAT 3000 

TATGTTAGAA TTAAGTTTAT GAAATGAGGT TTCCTCATAC CTAGCAAGAC TAGGAATAAA 3 0 60 

AATAGAAATT AGGTAGCTAG ATGTCATCTA AGGTTATTGT TACAATTTTC GGTGCGAGTG 312 0 

GAGACCTGGC TAAACGCAAG CTCTACCCTT CCCTTTTTAG ACTATATCAA TCCGGCAATC 3180 

TTTCCAAGCA CTTTGCCGTT ATTGGAACTG CCCGTAGACC TTGGAGTAAG GAATATTTTG 3 24 0 

AATCTGTAGT TGTCGAGTCC ATCCTTGATT TGGCAGATAG TACCGAGCAA GCCCAAGAAT 33 00 

TTGCTAGCCA CTTCTACTAT CAAAGCCATG ATGTCAATGA TTCGGAACAT TATATTGCTT 33 60 
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TGCGTCAATT ACAAGCTGAG CTTAATGAAA AATACCAAGC TGAACACAAT AAGCTCTTCT 3420 

TCTTGTCTAT GGCACCTCAG TTCTTTGGAA CCATTGCCAA ACACCTCAAA TCTGAAAACA 3480 

TTGTCGATGG CAAAGGTTTT GAGCGCTTGA TCGTTGAAAA ACCATTTGGT ACAGATTACG 354 0 

CAACTGCAAG CAAGTTGAAT GACGAACTCC TAGCAACATT TGACGAAGAA CAAATTTTCC 3600 

GTATCGACCA TTATCTTGGT AAGGAAATGA TCCAAAGCAT CTTTGCAGTT CGCTTTGCAA 3 6 60 

ACTTGATTTT TGAAAACGTT TGGAACAAGG ATTTTATCGA CAATGTTCAA ATTACCTTTG 3720 

CGGAGCGCTT GGGTGTAGAA GAACGTGGTG GCTACTATGA CCAATCCGGT GCCCTCCGTG 37 8 0 

ACATGGTCCA AAACCACACT CTACAACTTC TTTCGCTCCT CGCCATGGAC AAACCAGCAA 384 0 

GCTTCACAAA AGACGAGATT CGTGCTGAAA AGATTAAGGT CTTTAAAAAC CTCTATCATC 3900 

CAACTGATGA AGAACTCAAA GAACACTTTA TCCGTGGGCA ATACCGCTCT GGTAAGATTG 39 60 

ATGGCATGAA ATACATCTCT TATCGTAGCG AGCCAAATGT GAATCCAGAA TCAACAACTG 402 0 

AAACCTTTAC ATCTGGTGCC TTCTTTGTAG ACAGCGATCG ATTCCGTGGT GTTCCTTTCT 4080 

TTTTCCGTAC AGGTAAACGA CTGACTGAAA AAGGAACTCA TGTCAACATC GTCTTTAAAC 414 0 

AAATGGATTC TATCTTTGGA GAACCACTTG CTCCAAATAT TTTGACCATC TATATTCAAC 420 0 

CAACAGAAGG CTTCTCTCTT AGCCTAAATG GGAAGCAAGT AGGAGAAGAA TTTAACTTGG 42 60 

CTCCTAACTC ACTTGATTAC CGTACAGATG CGACTGCAAC TGGTGCTTCT CCAGAACCAT 4320 

ACGAAAAATT GATTTATGAT GTCCTAAATA ACAACTCAAC TAACTTTAGC CACTGGGATG 4380 

AAGTTTGTGC GTCATGGAAG TTGATTGACC GTATTGAAAA GCTCTGGGCT GAAAATGGTG 4440 

CCCCACTTCA TGACTATAAA GCTGGAAGCA TGGGACCTCA AGCCAGCTTT GACCTACTTG 4500 

AAAAATTCGG TGCCAAATGG ACTTGGCAAC CAGATATCAC CTATCGTCAA GATGGTCGCT 456 0 

TAGAATAAAA AAATTTCCTG CAAGTTTATG CcTTGCAGGA TTTTTGCTTC TGATTAGATT 462 0 

AAACCTTCCA AGAGACCTTT CATAAAGTTT TCTGAGTTAA ACTCTCCAAT ATCATCGATT 4680 

TTTTCACCAA AACCAATCAA TTTTACAGGA ATATTGAGTT CTTCACGAAT GGCTAGAACC 474 0 

ACACCTCCTC GAGCAGTTCC ATCAATCTTA GTCAAAACAA TTCCCGTTAA AGGTGTGATT 4800 

TTCGAAAATT CTTTGGCCTG TACTAGGGCA TTTTGACCTG TTGATGCATC AAGTGCCAAG 4860 

AAGGTTTCAT GTGGTGCTTC TGGCACAACA CGTTTGATAA TACGACCAAT CTTTTCCAAC 492 0 

TCAGCCATAA GGTTATCCTT ATTTTGCAGA CGACCAGCAG TATCAATCAT GAGAATATCG 498 0 

ATACCTTCAG TCACGGCACG TTCCATACCA TCAAAGACCA CGCTGGCTGG ATCAGCTTTT 5040 

TCAGGTCCAG TTACTACTGG AACATCTACT CGTCGGCCCC ATTCAGCTAG CTGAGCTACT 5100 
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GCACCCGCAC GGAAGGTATC TGCTGCAACC AGCATGACCT TCTTACCAGC TTGTTTGTAG 5160 

CGGTGGGCTA GTTTTCCGAT AGAAGTTGTT TTCCCAACAC CATTCACACC AACAAAGAGC 52 20 

ATAACTGTCA AGTTATCTTG GAAGTGGATG CTTTCATCGT AGCTACCATC CTTTTCATAA 52 80 

AGCTCAACCA ATTTCTCAAT GATGACACGA CGAAGTACAT CAGGTTTCTT GGCATTTTCA 5340 

AGCTTGGCTT CGTAACGTAG TTCCTCCGTT AAGTTAGAAG CGACTTGGAC ACCAACATCA 5400 

CTCATAATCA GCAGTTCTTC CAGTTCCTCG AAAAATTCT'P CGTCAACAGA GCGGAAGTTA 54 60 

GCAAAGAAGG CATTCAAGCG GGCACCGAAA CCTGTGCGAG TTTTCTTAAG ACTGCGGTCA 5520 

TATTTTTCCT GAACAGTTTC TTCTGTTTGA GGAGCTTCTG GTTCAAGCAC TTCAGAATTA 5 580 

TTTTCTTCTA CAGTTCCTTC GTGCTCAAGC TTCTCTTCCT CTGGTAATTC TTCTGAGTTT 5640 

GGTAATTCTT CTATTTCTTC TTGAGAAACC CCTACAGCTG GCTCTGAATC CTGACTTTCT 57 00 

TCAACTGTGT CTTGGATTTC CTCTTCTTGG AACACAGCTT GTTCAACAAT TTCAACCTCT 57 60 

GCTTCTTCCT GAGAAACTTC CTCAACTTCT GTGAAGGTAG GATCAACATC TTCAGACAAA 5 82 0 

TCAAGATTTT CCAGAGCTTC TTTTACAACT TCTTCGATTT TAGGTTCTTC TTTTTTTCCG 5 8 80 

AATAGACGGT CAAACAATCC CATATCTTAG TTCTCCTTTA GCACATATTC TTCGATAGCC 5 940 

CAGGCGACAG CTTCCTCATC GTTGGTCATC GGCGTCACTA CATTTGCGGC TGCCTTTACT 6000 

TCAGGAACAG CGTTTTGCAT AGCAACACCA AGACCTGCCC ATTCAATCAT AGAGAGGTCA 60 60 

TTGGCCTCGT CACCACAAGC CATCACTTGA CTTTGGTCGA TTCCAAGATG GCTGATTAGT 6120 

TTTGCCAAAC CTGTTGCTTT ATGAACATTC TTTGGTGACC ATTCTAGCAA CATTTCACGT 6180 

GATTTAAAGA TTTCATATTG GTCAAACAAT TCTGGAGAAA TCTTCTGAAT GGCTGCATCC 62 40 

AAGGGTTCTT GAGCAAAGGC AGTCACGCAT TTGTTGTAGG TCATTTGACT AGATAAGTCT 63 00 

TCAAAGTCCA CTGGAACAAA GGTCAAAGCT GGATTGAATT TGGCATAAAG ACTTTCTTGG 63 60 

TCCGATTGGA TTTGATAAAC TGTTCCTTCT GAGATGGCAT CAAGAGGCAG TGATAATTTC 6420 

TCTGTTTCTT CATACAAACG TGCCACATCA TCATATGAAA AGACTGTTTT ATCAAGGATT 64 80 

TCTCCTGTAT TTTTCTGAAC TAATCCACCA TTAAAAGTAA TGGTATACTC ATCTTCCTGA 6540 

CCGTCAGTCC CTAACTCATG GAGAAAGAAA TCCATGGCTT TTAAGGGACG ACCAGTTGTC 6600 

AATACGACCT TGATACCACG ATCACGCGCA gCTTGCAAGG TTTCCTTGGT ACGATCCGTC 6 6 60 

AGCCTTTTAT CAGTAGTCAG CAAGGTCCCG TCCAAGTCCA ATGCAATCAA TTTTATATCT 6720 

GCCATTATAA GCCCTCCATA TAAGCTATAA CCGACCGTTC CTTATGGTGA CCAATCACAG 67 80 

TCTTTGCTAA TTCTAAAATT TCAGGTCGTG CATTTTCAGG AGCTACAGGA TGTCCCACAA 684 0 

CCTGCATCAT ATGTAAGTCA TTAAGATTGT CTCCAAAAGC CATGACCTGA TCCATTGTGA 69 00 
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TACCAAGTTT TTTAACTAAT TCAACAATGG CCACTCCCTT ATCGACATAG TCCAGAACAA 6 9 60 

TATCAATGGA TTCAAAGCCA GTTGTCATGG CCTTAACACC AGGAACGTTT TCGTTTACCC 702 0 

AAGCCTCCCC ATCTTCCAGC GTTTCTTCTG TGAAGTTGGT TGTAAATTTG AAAATGTCAT 7080 

CTGTGATATC TTCCAAACTC GCTACTTTTT GGATATTTTC ATTATAGTGC TGACTCACTT 7140 

TCAAATAGGT CTCATCAACC GTATCTAGAA CATATGAACC CTTCTTACCC GTCAAGAGCA 72 00 

GTTTATTGAT ATCTACATAA GGTGAAGTTT TCAGCTTTTC AAAAGTTGCC AGATAAAAGT 72 60 

CACGAGACAT AGTCGCTTCA TACAAGTCCT GACCTTGATA CTCTACCAAA CTGCCATTTT 732 0 

CCGCGATGAA AATAATGTCA TCACGAACAC CAGCAAATAA TTTTTCTAGA GACAGAAATC 73 80 

CCCGACCCGA AGCTACCGCA AAGTAAATCC CTTTTTCCTT GTAGGAAACC AAGAGAGACT 7440 

TGAGACGATC CATATCAAAG CGTCCATTCC CATCTAGGAA GGTTCCGTCC ATATCCGTTG 7500 

CTACTAGTTT AATTGTCATC CTTCAATACT TTCTAAATCT TTTAACTTAA CTGAAACAAT 756 0 

CTTTGAAACA CCCGATTCTT GCATGGTCAC TCCATAGATG GAATCAGCCG CTGCCATGGT 7620 

TCCCTTACGG TGGGTTACGA CGATGAACTG GCTGTCCTTG TCAAAGCGGT TGAGGTAATC 76 80 

CCCAAAACGT TTAACATTGG CTTCATCCAG CGCAGCTTCC ACCTCATCCA AGATAACAAA 774 0 

TGGAATAGTC TTGACACGAA TAATGGAGAA GAGCAAGGCA AGAGCCGATA GGGCTTTTTC 7800 

ACCACCACTC ATGAGATTAA GAGACTGGAT TTTCTTGCCT GGTGGTTGGA CAGAAATTTC 78 60 

AACCCCAGCT GTCAGCAAGT CTCCTTCAGT CAAAATGAGG TCAGCCTGAC CTCCACCAAA 7920 

CATCTGCTTG AAGGTCACTT TAAAGGACTC ACGAATGACC TCAAAGGTTG ATTTAAAGCG 7980 

TTCCTTGACC TCATCATTCA TCTCTGTAAT GGTCTCAAGG AGCAGGTTTT TCGCAGACAA 8040 

AATATCATCA CGTTGGCTAT TTAGGAAATC CAGACGGTTG TGAACTTCTT CGTACTGTTC 8100 

AATAGCGTCT AAATTGACAG GACCCAGTGA GCGTATAGCC TTCTCTAAAT CCTTAACTTC 8160 

TTGCTCTGCC AGATTGAGAT TTTCCAACTC ATGCGCCTTT TCTAAAGCTT CTGTGTAGCT 822 0 

GATCTGGTAC TGGTCTGTTA ATTGACTTTG TAGATGGCGC AAGCGCTCGC TAACCTTTTC 8280 

TTTCTTGGCT TCAGCACGAG TTTGCTTGCG AATCCACTCT TCATTCTGCT GGCGAGCCTG 834 0 

ATCCAAATGA CTAGCAATAT CATCCAGTTG ACCCTCAATA TCATCCAACT CAAACTGCTT 84 00 

GCGAATCAAA CCTTGTTGGA GATTTGTTTT TTGAGTTTTG GATTCTTCCG CCTGTTGACT 84 60 

GAGCAATTCT GTATCAACCT TCTCAAGATT ATCAATCTTT TCTTGAAGAA GGCGCTGGAT 8520 

TTCCTCTTGT TCAAAATCAA GATTGTCCAA TTCCTTGCCT AAGCGTTCAA TATCAGCAAC 8580 

TTCATAACGT TTTTGCCCTT GCAGTTCTGT CTTAAGCAAA CGAGCTTGCG CTAGCTCTTC 8640 
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CTGCAAGTTT TGATAGCGTT CTTGGATGGC ATTTTTGTTA GACTTAATCT CTTCAATCTC 8700 

AGCTTCCAGA TTTTGCTTGT CACTGGAGAT TGCAGCAAGA CGCTCTTGGC AGTTTTCCTT 87 60 

ATCCGCTTGC CAATCTCCCT CGGAAAGACG ATCTATTTCC TCTTCTTGGA GTTTCCAAAG 8820 

AGTTTCCAGT TCTTCAACTT GCTGACTAGT TTGCTGATAA GCGAGGAACA AGCCTTGCTC 888 0 

CTGAATACGT GCCTGCTCTC CTTGAGATTT AATAGCTTCT AATGACTCGG TCAATCTGGC 8940 

CATCTCATCT TGCAAGGTCT TCAAAGTCGC CTCTTCTGAA CCCAAGCTTG CTTCTTCTTC 9000 

AGCAATTTCT TTTTGTAATT GCTCCAGTTC TGGCTTGATA AAAATGCTGT TATTCTGGCG 90 60 

ATTGGCACCA CCTGCATAAG AACCACCTGT GCGCAACTCT GTCCCATCCA ATGTCACCAT 912 0 

ACGAACCTGA TAACGAACTT GGCGAGCTGC TGCACGCGCA TGTTCTACGG TATCAAAGAT 9180 

AGCCGTCGTA GCTAGCAAGT TCTTGAAAAT GGCTTCCAGT CTAGTATCAA AAGTCACCAA 924 0 

CTCATCTGCC ATCCCAAGGA AACCTGGGCT TACAGCGATA GCATCTTGGT TCTGACTAGA 9300 

AATCGTACGC GCCTTGATAG TGGTCAAAGG AAGAAAGGTT GCACGACCGG CTCTGTTCCG 936 0 

TTTAAGGAAG TCAATAGCCT TGGTTGCCGA CTCTTCATCT TCTACGATGA TATGCTGGCT 942 0 

ACTTGCCCCT AAGGCAATCT CTAGGGCAGT TTGATAATAA ACATCAAAGG TCAGATGCTC 9480 

ACTGACTGCA CCAATAATCC CACCTAGGCG ATCTTTTTCT TGGAGAACAC TCTTAACACC 954 0 

TGCATAAAAG TTACTATGAT TTCTCAGGAT ATTTTCCAAA CTTTGAGCTC TGGCCTGCTT 9600 

GTTTTTGAGA TTATCCAGAC GGTCAAAGAG TTGGCTTTGT TGAGCTTGAT AGGAAGTTTT 9660 

CTGCTCCTCT TGCTCCTTGG CAATAGCTTG GTAGTCAGCC AATAATTTCT GAACCTGCTC 9720 

CTTGGCAGTT TCAAGCTCTT CCTTTTGCTG ACTAGCCTTC TCTTTAGCTA TAGCTAATTG 9780 

CTCTTTCAGC TTTTCTAGTT GATCTGCTTG TTTTTGAGAA AGCTGACGAC TATTTTCCAA 984 0 

CTCATTCTCA ATACGGGTCA ACTGGTTTGA GACATCCGCT TCTTCTTGTA AAAGAGCTAC 9900 

AAAGCGTTCA CGTAAGAGCT CAATCATCTG ATCAGGATCG TCTGAGAAAG CCAGCAATTC 996 0 

AGCTTCTAAA CGATTGAGTT TTTGATTATT TTGGACTAGA TTTCCCTCTA ACAGAGCTAA 1002 0 

AGAGCTTTCT TTATCAGACT TTTCTTTGCT GAGTGAATTT CTCTTATCCT CCAAAGCAGC 1008 0 

CAAACGGGCT TGTGCCTCCT GTTGATTCAA GGCCACTTGC TCGGACTCCA GTTTCGATAG 10140 

GGCTAATTTT CTTTCTAAAT CACTAATCAG ACTAGTCAAG TCCATCAAAC TGCCTTGGTC 1020 0 

TTTGGCCATT TCAGCCTGTA AATCTTGGCG TTGCTTTTTA AGAGTTTGAT TTTCTTCTTC 10260 

TAATTTTTCA CGCTTTTGGT AATAACTCAT CAAGAGT7CT TGAACCTGAG TCAACTCTTC 10320 

TTCTGTCGAC TCTAGTTCAG CCTTATTTTC CTTGATTTGA GCAACCAGAA CATCTAAATA 10380 

AATAGCCTTA CGTTGTCCTT CCAAGTCTAA AAACTTACGG GCATTCTCAG CTTGCTTCTC 10440 
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AAGAGGCTTG ATTTGATTAT CCAACTCGTA GATAATGTCC TCTAAGCGGT CCAGATTATC 10500 

CTGAGTTTGC TGCAGTTTAC TCTCGGTTTC TTTTCTGCGA GTCTTGTATT TTAAAACTCC 10560 

AGCAGCTTCT T C AAAAATAG CTCGTCGTTC CTCAGGCTTG GAATTAAAAA TCTCCTCAAC 10620 

CTTCCCTTGG GAAATAATAG AGAAGGAATC TCGTCCCAAT CCAGTATCCA AGAAGAGGTC 10680 

ATGAATATCA CGCAGACGGA CTTTCTTGCC GTCAATCTTG TATTCGCTAT CTCCACTACG 10740 

ATAGACATGG CGTTCCACCC TGATTTCTTG ACCTGCATCC TTGATAAATC CGTCATGATT 10800 

ATCCAGAGTC ACAACTACAG AAGCATAATT GAGCGGTTTG CGACTTTCGG TTCCAGCAAA 10860 

GATGATATCC GGCATCTTGC CCCCACGGAG ACTCTTGACA CTAGACTCCC CCAAAGCCCA 10920 

ACGCAGACTT TCTGTAATAT TGGACTTTCC AGATCCATTG GGTCCAACAA CTGCCGTCAC 10980 

ACCTTGGTCA AAAACGACCT TGGTCTTATC AGCAAAAGAC TTGAACCCCT GAATTTCGAT 11040 

TTCCTTTAAA TACATGAATC CAGCCCCTTC TCAACGGCAT TTTTGGCAGC TTCCTGCTCT 11100 

GCTAATTTCT TAGAACGACC TTGGCCTTGA CCGATGCTCT TACCTTCAAC AAGAACTTCT 11160 

ACATCAAAAA CCTTATCGTG AGCAGGCCCT GTTTCAGAAA TCACCTGATA ACGAATAGCC 11220 

ACATCACCAT TGACCTGAAG CAACTCTTGG AGATGGGTTT TATAGTCTGT AATCATCTCA 11280 

AACTCGCCTG CTTCAACCTT AGGAATCATG ACTTGATAGA TAAATTCCTT GACCTTGGCC 11340 

ACATCCTTAT CCAAAAGAAG GGCACCAAGA AAGGCTTCAA AGGCATCACC AAGAATGGTG 11400 

TCACGATTGC GACCACCTGA TTTTTCTTCC CCTTTACCCA ACTTGATAAA CTGGTCAAAC 11460 

TGGCAATCAC GCGCAAAACC AGCTAAACTC TCCTCACGGA CAATCATAGC ACGGAGTTTT 11520 

GATAGGTCAC CTTCAGGCTT TTTAGGATAT TTTTTATATA GATATTCTGA AATCAATAAC 11580 

TGTAGAACAG CGTCTCCTAA AAATTCCAAG CGTTCATTGT GTGAAATTTT TAAGAGGCGG 11640 

TGCTCATTGG CATAACTCGT ATGAGTAAAG GCAGTTTCCA GTAACTTTTT GTCTGCAAAT 11700 

TCGATTGCAA AATGATTCTT TAGTACAGTT TGTAATTCTT TCATACCAAC CTCTTTCTAA 11760 

CTGATAATAG TCCTTTTTAT TATATCAAAA AAAGCCCCCT. GAGTCACTCT AAAACGGGAC 11820 

TGGAAAGCAT TTGGGAATTC TTTAGACAGA GATTCTCAGT TTTAGCGGCA AATTTGGGTC 11880 

AGGATAAAGA AAAAAGCCCT ATTAAAGGCT TTTTAGGATG TTTACATCCA CCCTGAGGGA 11940 

ATCGAACCCC CATCTCAAGA ACCGGAATCT TACGTGATAT CCATTACACT AAGGGTGGAA 12 000 

ACTTGTTTTA TTATAACAGA AATTTGCTCT AATAACAAGT TTTTTGGTCA AAGACCCCGT 12060 

CTTAGTGGGA AGCATCCCCA TTCCAGATGG AGTTTTTCAC GATCACATAA TCAACGTGTT 12120 

TAAGGTCAGC AACCTGACGT CCACCTGCAT AAGAAATAGC ACTTTGAAGG TCTTGTTCCA 12180 
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TCTCAGTTAA AGTGTCTTGC AGATGACCTT TAGCAGGAAG CAAGATACGT TTGCCTTCCA 12 24 0 

CATTTTTGTA AGCACCTTTT TGATATTGTG AGGCTGAACC ATAATATTCT TTGAACTGTT 12 300 

CACCATCGAC TTCAATCGTT TTCCCTGGAC TTTCAATGTG TCCTGCAAAG AGGGAACCAA 12 3 60 

TCATGATCAT GCTAGCACCG AAGCGGATAG ACTTAGCAAT ATCACCGTGA GTACGAATTC 124 2 0 

CTCCATCAGC GATAATCGGT TTACGCGCAG CCTTGGCACA CCAGCGTAGA GCAGCCAACT 124 80 

GCCAACCACC TGTACCAAAA CCAGTCTTAA CCTTGGTGAT ACAAACCTTA CCAGGACCGA 1254 0 

TTCCGACCTT AGTAGCATCC GCACCAGCAT TTTCCAATTC ACGCACAGCT TCTGGTGTTC 12 600 

CCACATTTCC AGCAATGACA AAGGTATCTG GCAATTCTTT CTTGATGTGT TGAATCATAG 12660 

AAATCACGCT ATCCGCATGA CCATGAGCAA TATCAATAGT GATATACTCA GGAGTATCAG 1272 0 

CCTTGAGCTG GCTAACAAAA TCATACTCAT AATCCTTAAC ACCGACAGAG ATAGAAGCAA 1278 0 

TGAGCCCTTG ATTGTGCATT CGTTTAATAA AAGGAATGCG TCCTGCCTCA TC'AAAACGGT 12840 

GCATAATGTA GAAGTAACCA CCTTTAGCCA GTTGCTCTGC TACATTTTCA TCCAAAATCG 12900 

TCTGCATATT CGCTGGCACA ACAGGTAGTT TAAAGGTGTG ATTTCCTAAA GTGACACTTG 12960 

TATCCGCTTC TGCACGGCTT TTAATGACAC ATTTATTTGG AATCAATTGA ATATCTTCGT 13020 

AATCAAAAAT TGGAAATTCA TTTAACATAT CGATGTCTCG TTTCTTTTGT AATGACCTAC 13080 

CTATGCTCTT GCATCACTAC GCCTTTTCCG ACGTTTCCTG G 13121 
(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 957 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 

CCGAATGCAA TGTTTACGGT TGAACTTGAA AATGGACATC AGATTTTAGC AACAGTTTCT 60 

GGTAAAATTC GTAAAAACTA TATTCGTATT TTAGCGGGAG ATCGTGTTAC TGTCGAAATG 120 

AGTCCATATG ACTTGACACG TGGACGTATC ACTTACCGCT TTAAATAATC GAAAAACTTG 180 

GAGGGATAAG AAATGAAAGT AAGACCATCG GTCAAACCAA TTTGCGAATA CTGTAAAGTT 240 

ATTCGTCGTA ATGGTCGTGT TATGGTAATT TGCCCAGCAA ATCCAAAACA CAAACAACGT 300 

CAAGGATAAG ATAGAAAGGA GAAAACATGG CTCGTATTGC TGGAGTTGAT ATTCCAAATG 3 60 

ACAAACGCGT AGTAATCTCA TTGACTTATG TTTATGGTAT CGGACTTGCA ACATCTAAGA 420 

AAATTTTGGC TGCTGCTGGA ATCTCAGAAG ATGTTCGTGT ACGTGATCTT ACATCAGATC 480 
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AAGAAGATGC TATCCGTCGT GAAGTGGATG CAATCAAAGT TGAAGGTGAC CTTCGTCGTG 54 0 

AAGTAAACTT GAACATCAAA CGTTTGATGG AAATCGGTTC ATACCGTGGT ATCCGTCACC 600 

GTCGTGGACT TCCTGTCCGT GGACAAAACA CTAAAAACAA CGCCCGCACT CGTAAAGGTA 660 

AAGCTGTTGC GATTGCTGGT AAGAAAAAAT AAT AT AG GAG GTAAAAGTCT TGGCTAAACC 720 

AACACGTAAA CGTCGTGTGA AAAAGAATAT CGAATCTGGT ATTGCTCATA TTCACGCTAC 780 

ATTTAATAAC ACTATTGTTA TGATTACTGA TGTGCATGGT AATGCAATTG CTTGGTCATC 840 

AGCTGGTGCT CTTGGTTTCA AAGGTTCTCG TAAATCTACA CCATTCGCTG CTCAAATGGC 900 

TTCTGAAGCT GCTGCTAAAT CTGCACAAGA ACACGGTCTT AAATCAGTTG AAGTTACTGT 960 

AAAAGGTCCA GGTTCTGGTC GTGAGTCAGC TATTCGTGCG CTTGCTGCCG CTGGTCTTGA 1020 

AGTAACAGCA ATTCGTGATG TGACTCCAGT GCCACACAAT GGTGCTCGTC CTCCAAAACG 1080 

TCGCCGTGTA TAATCATCGC ATTACACTGC TTTTCGTTTA AGAGGGAGTA ACTAAATGAT 114 0 

CGAGTTTGAA AAACCAAATA TAACAAAAAT TGATGAAAAT AAAGATTATG GCAAGTTTGT 12 00 

AATCGAACCA CTTGAACGTG GCTACGGTAC AACTCTTGGT AACTCTCTTC GTCGTGTACT 12 6 0 

TCTAGCTTCT CTACCAGGAG CAGCTGTGAC ATCTATCAAC ATTGATGGTG TGTTACATGA 13 2 0 

GTTTGACACA GTTCCAGGTG TTCGTGAAGA CGTGATGCAA ATCATTCTGA ACATTAAAGG 138 0 

AATTGCAGTG AAATCGTACG TTGAAGACGA AAAAATCATC GAACTGGATG TTGAAGGTCC 1440 

TGCTGAAGTA ACAGCTGGTG ACATTTTGAC AGATAGCGAT ATTGAAATTG TAAATCCAGA 1500 

TCATTATCTC TTTACAATCG GTGAAGGTTC TTCTCTAAAA GCGACTATGA CTGTTAACAG 15 60 

TGGTCGTGGA TATGTACCTG CTGATGAAAA TAAAAAGGAT AATGCACCAG TTGGAACACT 1620 

TGCTGTAGAT TCTATTTATA CACCAGTTAC AAAAGTCAAC TATCAAGTGG AACCTGCTCG 1680 

TGTAGGTAGC AATGATGGTT TCGACAAATT AACCCTTGAA ATCTTGACAA ATGGAACAAT 174 0 

TATTCCAGAA GATGCTTTAG GGCTTTCAGC ACGTATTTTG ACAGAACATC TTGATTTGTT 1800 

TACAAATCTT ACTGAGATTG CTAAGTCAAC TGAAGTGATG AAAGAAGCTG ATACTGAATC 18 6 0 

TGACGACCGT ATTTTAGATC GTACGATTGA GGAACTGGAC TTGTCTGTGC GTTCATACAA 1920 

CTGTTTAAAA CGTGCCGGTA TCAATACTGT GCATGATTTG ACAGAAAAAT CTGAAGCAGA 19 8 0 

GATGATGAAA GTACGAAATC TTGGACGCAA GAGTTTGGAA GAAGTGAAAC TCAAACTCAT 2040 

TGATTTGGGT CTTGGATTAA AAGATAAATA AAGGAGGAAT ACATGGCTTA CCGTAAACTA 2100 

GGACGCACTA GCTCACAACG TAAAGCAATG CTTCGCGATT TGACAACTGA CCTTTTGATC 2160 

AACGAATCAA TCGTGACAAC TGAAGCTCGT GCTAAAGAAA TCCGTAAAAC TGTTGAAAAA 2220 
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ATGATTACTC TAGGTAAACG TGGTGATTTG CATGCACGTC GTCAAGCAGC TGCTTTCGTA 2 2 80 

CGTAATGAAA TCGCATCTGA AAACTATGAT GAAGCAACTG ATAAGTACAC TTCTACTACA 2 340 

GCACTTCAAA AATTGTTCTC AGAAATCGCA CCTCGTTATG CTGAACGTAA CGGTGGATAC 24 00 

ACTCGTATCC TTAAAACTGA ATCACGTCGT GGTGATGCAG CGCCAATGGC GATCATCGAA 24 60 

TTAGTATAAA ATCATCAATT TTGTTGAGTG TTATGATGAT GGAGTCTTGT GCTCTTAGTC 2520 

TAGCTCTGGT CTACCGCTAG GATTTCGGTC CTAGCGGGAA CACTCATCAT AAGTTGGGAT 2 580 

AGTAGACGCT TGTTTACGAA ATTGTTTTTT TCTTAAGAAC AACTTCGTAA GCAGGCGTTT 2 640 

TTGAGTATTT TCGTTAGAAT TATGCTATAC TATTTGAAAA GAATCCTGTT TAATGTTAAG 27 00 

GTTTCTTATT TTAAGAAGAA TTGGAGTTTA CTTATGAAAG CCATTATAAC TGTTGTTGGT 2 7 60 

AAAGATAAAT CTGGAATTGT TGCAGGTGTT TCTGGTAAAA TTGCAGAATT AGGATTGAAT 2 820 

ATTGACGATA TCTCTCAAAC TGTCTTGGAT GAATATTTTA CGATGATGGC TGTTGTATCT 2 880 

AGTGATGAAA AGCAAGATTT TACCTATCTT CGTAATGAAT TTGAAGCTTT TGGGCAAACT 2 940 

TTGAATGTAA AAATCAATAT TCAGAGTGCA GCGATTTTCG AAGCTATGTA TAATATCTAG 3 000 

GAGGTCATCA TGGATATTAG ACAAGTTACT GAAACCATCG CCATGATTGA GGAGCAAAAC 3060 

TTCGATATTA GAACCATTAC CATGGGGATT TCTCTTTTGG ACTGTATCGA TCCAGATATC 3120 

AATCGTGCTG CGGAGAAAAT CTATCAAAAA ATTACGACAA AGGCGGCTAA TTTAGTAGCT 3180 

GTTGGTGATG AAATTGCGGC TGAGTTGGGA ATTCCTATCG TTAATAAGCG TGTATCGGTG 3240 

ACACCTATTT CTCTGATTGG GGCAGCGACA GATGCGACGG ACTACGTGGT TCTGGCAAAA 33 00 

GCGCTTGATA AGGCTGCGAA AGAGATTGGT GTGGACTTTA TTGGTGGTTT TTCTGCCTTA 33 60 

GTACAAAAAG GTTATCAAAA GGGAGATGAG ATTCTCATCA ATTCCATTCC TCGCGCTTTG 3420 

GCTGAGACGG ATAAGGTCTG CTCGTCAGTC AATATCGGCT CAACCAAGTC TGGTATTAAT 3 4 80 

ATGACGGCTG TGGCAGATAT GGGACGAATT ATCAAGGAAA CAGCAAATCT TTCAGATATG 3540 

GGAGTGGCCA AGTTGGTTGT ATTCGCTAAT GCTGTTGAGG ACAATCCATT TATGGCGGGT 3 600 

GCCTTTCATG GTGTTGGGGA AGCAGATGTT ATCATCAATG TCGGAGTTTC TGGTCCTGGT 3 6 60 

GTTGTGAAAC GTGCTTTGGA AAAAGTTCGT GGACAGAGCT TTGATGTAGT AGCCGAAACA 3720 

GTTAAGAAAA CTGCCTTTAA AATCACTCGT ATCGGTCAAT TGGTTGGTCA AATGGCCAGT 3 7 80 

GAGAGACTGG GTGTGGAGTT TGGTATTGTG GACTTGAGTT TGGCACCAAC CCCTGCGGTT 3 840 

GGAGACTCTG TGGCACGTGT CCTTGAGGAA ATGGGGCTAG AAACAGTTGG CACGCATGGA 3 900 

ACGACGGCTG CCTTGGCCCT CTTGAACGAC CAAGTTAAAA AGGGTGGAGT GATGGCCTGC 39 60 

AACCAAGTCG GTGGTTTATC TGGTGCCTTT ATCCCTGTTT CTGAGGATGA AGGAATGATT 402 0 
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GCTGCAGTGC AAAATGGCTC TCTTAATTTA GAAAAACTAG AAGCTATGAC GGCTATCTGT 408 0 

TCTGTTGGAT TGGATATGAT TGCCATCCCA GAAGATACGC CTGCTGAAAC TATTGCGGCT 4140 

ATGATTGCGG ATGAAGCAGC AATCGGTGTT ATCAACATGA AAACAACAGC TGTTCGTATC 4200 

ATTCCCAAAG GAAAAGAAGG CGATATGATT GAGTTTGGTG GTCTATTAGG AACTGCACCC 42 60 

GTTATGAAGG TTAATGGGGC TTCGTCTGTC GACTTCATCT CTCGCGGTGG ACAAATCCCA 4320 

GCACCAATTC ATAGTTTTAA AAATTAAGAA AATAGGAGAA ATTTTAAGTT CTATTTAAGA 4380 

TTAGACGTGT ATACTATAAT CATTAAATAA AGACCTCCTA ATATTATTTG AAACAGATAA 4440 

CACTGAATTA GTTTGAATTT GATTTTCATC TAATATCTTT ATTTAATGAA CTCCTAAACT 4500 

TTTTCATAAT AATCTCCTTC AAAAGTCGCC TGTATGGGTG GCTTTTATTT TATCATTCAT 4560 

GATATAATAG AAGCAAACGG AGGACGGAAA ATGGTAAAAG TACGATTGTA TTTGGTACGT 462 0 

CATGGCAAGA CCATGTTTAA CACGATTGGT CGCGCGCAAG GTTGGAGCGA TACTCCCTTA 4680 

ACTGCTGAAG GTGAACGAGG GATTCAAGAG TTAGGAATCG GTTTGCGAGA ATCTGATCTA 474 0 

CAGTTTGAGC GTGCTTATTC GAGTGATTCT GGTCGTACCA TTCAGACCAT GGGAATTATC 4 800 

CTTGAAGAAC TTGGCTTGCA GGGGGAAATC CCTTATCGCA TGGACAAGCG TATCAGAGAA 4860 

TGGTGTTTCG GTAGTTTTGA TGGAGCCTAT GATGGCGATC TTTTCATGGG CATTATTCCT 4 920 

CGTATCTTTA ATGTGGACCA CGTTCACCAA TTGTCTTATG CTGAACTGGC TGAGGGCTTG 4980 

GTAGAGGTCG ATACAGCTGG TTGGGCTGAA GGCTGGGAAA AACTCAGTGG CCGAATCAAG 5040 

GAAGGCTTTG AAATGATTGC AAAAGAAATG GAAGATCAAG GTGGAGGTAA CGCCCTTGTT 5100 

GTCAGCCATG GAATGACTAT TGGAACCATT GTTTATCTGA TTAATGGCAT GCATCCGCAT 5160 

GGTCTGGATA ATGGTAGCGT GACAATCCTT GAATATGAGG ACGGCCAGTT TAGGGTTGAA 5220 

GTTGTCGGTG ACCGTAGTTA CCGAGAGCTA GGACGTGAGA AGATGGAAGA AGGCTCTATT 5280 

TAATCAGTCT AGACTTGCTT GCCATGAGCT AGGGATTTGA TAAGAATATC AAGATAAGAA 5340 

AAAACAGCCG AGGGCACTCC TTTCGGCTGT TTTTGATGTG GAAAACTAAA GTGTAATGCT 5400 

ATTGCTTTTA GAGATTTTCA TAAACAAGAG CAAGGAACCT ACTGTTAGAA CAGTCAGGAT 5460 

AGTTGACAAG GTTGCGGCTA CACCGTAATT TCCTCTGAGA ACCTCTGTAT AAATAGCTAC 5520 

AGTCATTGTT CTTGTTTTGA CATTGTAGAG GAGGATAGAA GTAGAGAGTT TTGAAATCAT 5580 

TGTGACTCAA GATAAGATGG CTCCAGAAAT GATACCAGAT AGCATCATTG GAGTTGTAAT 5640 

CTTAGCAAAG GTATTGAGAC GACTACTTCC TAAGCTTTCA GCAGCTTCTT CAATACTTGG 5700 

TGCTATTTGT TGTAAGCTAG CAACAGATGA GCGAATAGTA TAAGGTAATC TTCTGGCAGA 5760 
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TAGAGACATA AT C AAG AT G A AAGCAGTCCC TGTAATCATA AGAAATCCAC TTCCAAATAG 582 0 

ACCAGTATTG AAGGAAGAAA TGAAGGCAAT CCCTAGAACG GTTCCTGGTA CAATATAAGG 58 8 0 

TACCATACTG AGGCTGTCAA TTAAGTTTGT AAACAAATTC CGTTTTCTAA CGGCTAGGTA 594 0 

GGAGATAAAT GTCGCAAATA GAACAACTAG AACTAAGGCA ATCAAAGGGA TACGAATGGT 6000 

ATTGAAAATA GCAGATCCCA TACGATGGAA AGCTACCTTG TAACTGTTTG GAGAATAACC 6060 

TTTAACAGAT ACCATACCTG ATGTTTTTAG GAAAGAGGTA TAAATTAAGT AGATTTGAGG 612 0 

TAAAACAGAG ATAAAGATAA TTCCGTAGAC TGTTGCATAA ATGGCAGCCA TTTTTCCTTT 6180 

TGTAGTTTTT TTAGGCTCAA TTGGATGGAG CAGATTCATG CTGAAACTGT AGCGGTTTGC 62 40 

AATGTGTTTT TGGATAAGGA AAATTGCCAA GGCAATGATA ATCGCCATAA TTGCAAAAGC 6300 

AGAATTTCCT CCAACCTCGC TAATAAATTG GGTATAAATC AGGACAGGGA AAGTCCGATA 63 60 

CCCTTCGCCA ATCAACATAG GCGTTCCAAA GTCTGAGAAT GCTCTCATAA ATACAAGCAA 642 0 

GGAGCTGCTA GTAAGGTTGG AACTAGGAGA GGTAAAACAA CCGTTACGAT AGGTTTAAAT 648 0 

CCGAAGGACC CCATGCTTTC AGCTGCTTCA AGTAGAGAAT TGTCAATACT GTTCATTGTT 654 0 

CCAGCAACAT ATAGAAATAC CAGTGGGAAT AGTTGCAGTG TAAAGACAAG TACAATTCCT 6 600 

TTGAATCAAT AAATATCGAT AGCTGGAAGA TAAAGGGCAT TTGTCAAAAA TTTAGTGATG 6660 

ACCTCATTTC GTCCTAGCAA GAGAACCCAG GAGTAGGCTC CTACGAAAGG AGCTGACATG 67 2 0 

GAAGCAATGA TAATCAATAT TTGTAGAAAT TTCTTCCCCT TGAAGTCATA CATAGAGAAG 67 80 

AGATAAGCTA ATAGGGTTCC TACAACTAAG GAAGTGATAG TAGCGGTAAT GGAAACCTTG 684 0 

AAACTGTTGA CTAGTGTCTC AGAGTAGTAG GCTTTACTAA AGAAAGTGAC AAAATTAGCT 6900 

AGTGAGAATT GTCCTTCATG TATAAGTGCT TGCTTGAGCA CGGTAACGAT AGGATAAACG 6960 

AGAAAGATAG GATAGGTAAG AAAGAGGAAG AAAGAGGAAA CTGTCCAAAT ATTTAGTTTT 702 0 

TTACGTTCCA TGGTTGACTC CTTTTATCAG GTTTTGGGAA CCATCTGCAG AAAAGATGTT 708 0 

TAATTTTTGC GTATTGATTC GTAGACGAAT ACGATTGCCT TTTTGTAGAT CTTCTTCAAA 714 0 

AGTTGATTCT TCACTAACTT GAATTTTTGA GGCAAAACCT GTCTCAATGA AATAATCCGT 72 0 0 

ATTTAGTCCA AGATAGACGC TATCTCTAAT AGTTCCTTCA ATATCTCCAG ATTCATCTTT 72 60 

GATAAACTCT TCGGGACGAA TGCTTACATG AATAGCTTGC TCCTCAACCT GATCAAGAGC 732 0 

TGGCATTCGA AGGGCATAGC CATCTGAAAA GACGATATAA GCGCCGTCGC TCCGTTTTTC 7380 

AAGATTGGCA GGGATAATAT TTGTGCGTCC GATAAAGGTT GCCACAAACT CATTAGCTGG 7440 

TTTATGATAG AGTTCTTTTG GTCGGCCGAT TTGTTGGATC ACCCCATCTT TCATAACAGC 750 0 

AATTTGGTCT GAAATAGCCA TGGCTTCTTC TTGGTCGTGG GTTACATAAA CAGTTGTAAT 756 0 
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TCCCACTTCG TGTTGGATTT CTCGGATGGC TTGACGCATA TCCAAGCGAA GTTTGGCCTC 762 0 

CAGATTACTA AGTGGCTCGT CCATGAGGAG AACACTTGGA TTAACCGCTA AGGCGCATGC 7 680 

CAAGGTGACA CGTTGTTGTT GTCCACCACT GAGTTTATCG GGCTTTCGAT CCGCATATTG 774 0 

AGCAATTTGC ATGAGTTCAA GATACTTGTT GGTCTGTTGA ATCAATTCTT CTTTTGGAAC 7800 

CTTCTTTTGC ATAAGACCAA AAGCAACGTT GTCTCGGACA GTCAAATGTG G G AAAAT AG C 786 0 

GTAGTTTTGG AAAACCATCC CGATATTGCG TTTGCTGGGT TCCATATTAT TGATTTTTGT 7 92 0 

ATCATCGAAG TAAAATTCTC CACCTTCGAT ACTGTTGAAA CCTGCAATCA TACGAAGAAG 7980 

GGTCGTTTTC CCACATCCTG AAGCTCCAAG AAGGGTAAAG AGACTTCCTT TTGGAATTGT 804 0 

AATGTTCAAA TTCTCAATAA CAGGGACATC GTGGTAGATT TTTTTGGCGT TAATAATTTT 8100 

GATCTCACTC ATAGTGAACC TCTTTTACTG TTTAGATTGG ATATCTGTAA AGACTTCGTT 8160 

GTATTTCTTA ACGATATCTG ATTTATTCTT GATGACATAA TCATAATCTT CAGTGAGTGT 822 0 

TTTGATTTTG TCAATTGGTT TCATGTTTTC GCTTGTTTTA GCATTTTTAC GAACAGGACG 828 0 

GTTAGTAGTG GTTGTACCAA GTGTATCTTG TACTTCTTGA GAGATAATAA AATCGATAAA 834 0 

TTTCTTGGCA TTTTCCATAT TTTTAGATTT TTTAACGATA GCAGCACTAG CAGGTAGGAA 840 0 

GACGGTTCCT TCTTTTGGAT AGACTACCTT AATGTTAGCT CCGTCATTTA AGAGTTTAAC 8460 

TGCTGGATCT TCATAAGAGA GACCAACAGC CATTTCTCCA TCAGCGACTA CTTTATAGAC 8520 

ACTAGATGAA CTTGAACCGA TTTTACCATC AATAAGTGTG AAAAGATCTT TTACATAAGA 8580 

CCAAGCCTTA TCATCTTTGT AACCACCTTG AGCTTGTAGC ATATTTGTTA ATTGAGCAAA 8640 

GGCGCTAGAA GAGTTTGCTG GGTCAGCAGT TGCGATTTTT CCTTTTAGTT CAGGTTTGAA 8700 

AAGATCGTTA TATCCTTCGA TGTTCATGCC TTTAGTTAAA TCAGGGTTGA CGATTAAAAC 8760 

ACTACCATCT AGTGTATAAG GAGTAGAGTA GCCAGTTGTG TTTTGATATT CTTTGATAAC 8 82 0 

ATTATCATTT TCTTTTGAAG TATAGTTTTC AAAGAGTTCT CCGTGGGTAG TATATTGTGT 8880 

ATAAGAACCA CCAAAGATAA CATCAGCTAC AGGAACTTCT TTTTCTGACT CTAGTTTTTT 8 940 

GAAAAGTTCT CCAGTACCAG CTTGAATCAG TTCTACTTTG ATACCATATT TTTCTTCAAA 9000 

GGCAGGAATA GTTGCTCCAA TTAAGCCCTC TGAGTTTGGT GAATAAACGA CTAGCGAACC 9060 

GCCGTCTCCT TTATCAGATG AACTGTCATC GGCAGATTCA TTAGAAGAAC AAGCAGCATA 9120 

ATACATCCAT TTCTTTTTCA TGATGGATAC CTCCGTTGTG TTATTTAAGT TTATTTTAAA 9180 

ACAATGTAAG CGTTTTTAAA ACATACAATT CTATTCTATA GTGTATTGAA TCTATAACAG 9240 

TACACTTTGA CTGCT AAAAT ATTTCTATAA ATTAATTTGA CTTTCCTGAT AGAGATGTTC 9300 
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ACATCTTATT TCAATTCACT ATATTAGAGT AAAATTCTCT ACAAAAAGAA GAATAGCCTA 93 60 

TTTTACTATT CTTCTGAGTG ATTTCAATTC CTTTGGGGAA ATATGGAGAT ACTTTTTAAA 942 0 

TCCTGACAAA TGGTTGTTTC TTTTTCTAAA TCGGTGATAC TGTATCGGAG AATGCGCGTG 9480 

AGGTCACAAA GGCTGCGATA GAGCTTCTAT GGAGAATTTC TTTTTGGAGA GATTTTTTAA 954 0 

AGGAATGAGA CATCCGCTAC CTCCTTGGAA GGTTTTTG 957 8 
(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

CGGGCTGTTG TGACGATTCT TATTTCTATC TGTGTTATCT TTTTGGGAAC TATTTTGGGT 60 

GTTGTCTTGG CTTTTGGGCA ACGTTCAAAG TTTAAACCGC TTGTTTGGTT GGCCAACTTG 12 0 

TACGTTTGGA TTTTCCGTGG GACACCGATG ATGGTTCAAA TTATGATTGC CTTTGCTCTT 18 0 

ATGCATATCA ATGCTCCGAC TATTCAGATT GGAATTTTAG GTGTTGATTT TTCGCGTCTG 24 0 

ATTCCAGGGA TTTTGATTAT CTCTATGAAT AGTGGTGCTT ATGTTTCGGA GACTGTTCGT 300 

GCCGGAATCA ATGCGGTTCC AAAAGGTCAG CTAGAAGCGG CTTATTCGCT AGGGATTCGT 360 

CCTAAAAATG CGATGCGTTA TGTGATTTTG CCACAAGCAG TCAAAAATAT CTTGCCAGCA 420 

TTGGGGAACG AATTTATCAC CATTATCAAG GACAGCTCCC TCTTATCAGC TATTGGGGTC 480 

ATGGAGTTGT GGAATGGGGC TACAACACTT TCTACAACAA CCTATCTACC TTTAACACCA 54 0 

CTTTTATTTG CAGCATTTTA CTACTTGATT ATGACCTCTA TTCTGACAGT AGCCTTGAAA 600 

GCTTTTGAAA AACATATGGG ACAAGGAGAT AAGAAATAAT GACAGAAACC TTGATAAAAA 660 

TTGAAAATTT ACATAAATCC TTTGGAAAGA ATGAAGTATT GAAGGGCATC AACCTCGAGA 72 0 

TTAAAAGAGG AGAAGTTGTC GTTATCATCG GTCCTTCAGG GAGCGGGAAA TCTACCTTGC 780 

TTCGCTCTAT GAATTTGTTG GAAGAAGCAA CCAAGGGGAA GGTTATCTTT GAGGGAGTCG 84 0 

ATATTACGGA CAAGAAGAAT GACCTGTTTG CCATGCGTGA GAAGATGGGC ATGGTTTTTC 900 

AACAATTCAA TCTCTTTCCT AATATGACTG TGATGGAAAA TATCACCTTG TCCCCTATCA 960 

AGACCAAAGG TGACAGTAAG GCCGTTGCAG AGAAAAGAGC TCAGGAACTT TTGGAAAAAG 1020 

TTGGTTTGCC AGATAAGGCA GACGCTTATC CACAGAGTTT GTCAGGTGGC CAGCAACAGC 1080 

GGATTGCCAT CGCGCGTGGG TTGGCTATGG AACCAGATGT TTTGCTCTTT GACGAGCCAA 1140 
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CTTCAGCCCT AGATCCTGAG ATGGTTGGAG AAGTTCTGGC TGTTATGCAA GATCTAGCCA 12 00 

AGTCAGGAAT GACCATGGTT ATCGTAACAC ATGAGATGGG ATTTGCCCGT GAGGTGGCAG 12 60 

ATCGTGTCAT CTTTATGGCA GACGGTGTGG TTGTTGAAGA CGGAACACCT GAGCAGATTT 132 0 

TTGAACAAAC CCAAGGACAA AGGACTAAAG ACTTCTTGAG TAAGGTTTTA TAAGTTAGCT 1380 

TTGTTTAGCT ATTTGTAGCC AGCTTTAAAC GTTAAAGAGA AGATTAGTGA AAAGCTCAAC 144 0 

CAGAGCTTTT TCTTATAGTT TAAAGCTATA GGATTGCCTA GGAAAGAAGT GTTAGAGCTA 1500 

CATTGTATTT TTTGGTATAA TTAAAGATAT TTGTAAGAAA AGAGAAGTGA TATGACACAG 1560 

ATTATTGATG GGAAAGCTTT AGCGGCCAAA TTGCAGGGGC AGTTGGCTGA AAAGACTGCA 162 0 

AAATTAAAGG AAGAAACAGG TCTAGTGCCT GGTTTGGTAG TGATTTTGGT TGGGGACAAT 1680 

CCAGCCAGCC AAGTCTACGT TCGCAACAAG GAGAGGTCAG CCCTTGCGGC TGGTTTCCGT 174 0 

AGCGAAGTAG TACGGGTTCC AGAGACCATT ACTCAAGAGG AATTGTTAGA CCTGATTGCT 1800 

AAATACAATC AGGATCCAGC TTGGCATGGG ATTTTGGTTC AGTTGCCATT ACCAAAACAC 186 0 

ATTGATGAAG AGGCGGTTCT ATTGGCTATT GACCCAGAAA AGGATGTGGA TGGTTTCCAT 192 0 

CCTCTAAACA TGGGGCGTCT TTGGTCTGGT CATCCAGTCA TGATTCCTTC GACACCGGCA 1980 

GGAATTATGG AAATGTTCCA TGAATATGGG ATTGACTTGG AAGGTAAAAA TGCAGTCGTC 204 0 

ATCGGTCGAT CCAATATTGT CGGAAAACCT ATGGCCCAGC TTCTTTTGGC AAAGAATGCA 2100 

ACAGTAACCT TGACTCACTC ACGTACTCAT AATCTTTCCA AGGTGGCTGC AAAAGCAGAT 2160 

ATTCTGGTTG TTGCAATCGG TCGTGCCAAG TTTGTGACTG CTGACTTTGT CAAACCAGGT 2220 

GCGGTAGTCA TTGACGTTGG GATGAACCGC GATGAAAATG GTAAGCTCTG TGGGGATGTT 228 0 

GATTATGAGG CGGTTGCCCC ACTTGCTAGC CACATTACGC CAGTCCCTGG AGGTGTCGGT 2 34 0 

CCTATGACCA TTACTATGCT GATGGAGCAA ACCTATCAGG CAGCACTTAG GACATTGGAT 2400 

AGAAAATAAG ATAAAAATTT TCTGAGGAAA GTGTATTTTC TATAGCTATA TCTAAAATGA 2460 

TAGAAATGAA TATTAAATTT TAGAAATAAG TTTATAAAAG GAGGTTTGCG CCTCCTTTTT 2 52 0 

GTTGTATAAT GGAGTGAGGT GATTAGATGA TTTTAAAAAT TTATAATGGG GAATATAGTT 2580 

TACAATGGGA TGGAATATAC TACTTAGCAC TAATTGATTA TCCAAATATT CAAGAGTGGG 2 640 

AATTAGAAAA AATTGCTAAA TTTATAGCTT ACGAAAAACT TCATAAACGT CAAACAAGTA 2700 

TTGAGTGTGC TGATTCTTGT TTAAAAAAAG AAATTTTAGA TTACATCTGT CAGCATCCCT 27 60 

TTCTGCCACC ATTTACTCCT ACAGATAAAA GAGTAGCCTC GACTTATGAC CTACATAAGA 2820 

GGTTAGTGAC TTCAGACTAC TGTAGTCATA CTACGACTAT AGATGCAGCG ATTTCTATTT 2880 
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TTAAAACTGG TCGTCTTTTA TCTGCTGTGA AAGCCTTTGG GCGAGATGCT GAGGAGTTGG 2 940 

TTTTGGATAG TCGAAATGCT GCATCTGATC CGATAGATTA TTTTGACTAT GTCATGTTAG 3 000 

GGTGGTCAAA TACAAGTTCT GGTTATCGAT TGGCGATGGA GCGTTTATTA GGTCGAGCTC 3 060 

CTTCAGAGAA AGAATTACAA GACAAGTTTA TTCCTGGAGT AAGTTTTCAT TTTATCTATA 3120 

CAGATTTGAT TAAAGTTCCT GGTTATATTT TTGATGGTTA CCATGCTGTA AAAATTAAGG 3180 

ACATGCTTAA TTTATTAAGT GAGTTGTATA TTTGCATTAT TCCAACTCAT AATAAGAGCC 3 2 40 

AATTTGAAAA TATTATTCCA ACCAAAATAC AAGATAGGGT GTATTATCTT GACTATGCTG 3 300 

GAGAAGACTT AGAAGAGTGG ACTAAGAAAG TCTATCAAGT TGTTTTAAAA CAATCAGATA 3 3 60 

AAGGATAGTT GAGGAAAAAA CGATGAAAGT GATTGATCAA ACCTTACTAG AAAAAGTCAT 3 4 20 

TATTGAACGT TCTTGTACAA GTCATAAAGG AGACTACGGT CGTCTGCTGT TGCTTGGTGG 34 80 

GACTTATCCT TATGGTGGTG CCATCATCAT GGCTGCTTTA GCAGCTGTAA AAAGCGGTGC 3 540 

AGGATTGGTA ACCGTTGGAA CGGACAGGGA AAATATCCCT GCTCTACACA GCCATTTGCC 3 500 

TGAGGCTATG GCCTTTTCTC TGCAAGATCA GTAATTGTTA CAAGAGCAAT TGGAGAAGGC 3 6 60 

AGAAGTTGTC TTGCTGGGGC CTGGTTTACG AGACGATACG TTTGGAGAAA ATCTTGTAAA 3 720 

ACAGGTCTTT GCTAGCTTAA AAAAGAATCA GATTTTGATT GTAGATGGAG GGGCCTTAAC 37 80 

CATCCTTGCT AGGACAAGTT TGTTGTTTCC ATCTAACCAG CTTATCTTAA CTCCCCACCA 3 840 

AAAAGAATGG GAAAAACTGT CTGGTATTGC TATTGAAAAG CAAAACGAAG GTACAACATC 3 900 

TAGTGCCCTG ACTTCTTTCC CTCAAGGAAC AATTTTGGTA GAGAAAGGTC CAGCTACTCG 3 960 

TATTTGGCAA GTTGGCCAGT CTGATTATTA CCAGTTAAAG GTTGGCGGTC CCTATCAGGC 4 020 

GACTGGTGGT ATGGGTGATA CACTGGCTGG AATGATTGCA GGATTTGCAG GCCAATTTCG 4080 

ACAGGCCAGT CTCTACGAAC GTGTGGCAGT AGCAACCCAT CTTCATTCAG CCATAGCCCA 4140 

AGAACTATCT CAAGAAAATT ATGTGGTCTT GCCGACGGAA ATTAGTAATT GTCTTCCTAA 4 2 00 

AGTAATGAAA AGATATGTCT AAAATAGTTA GACAAAAAAT GTTGATAATT TGTATCATTA 4 2 50 

TTCTTAATTC ACAAAAAACG AACGTTTAGT ATTCTTCTTG CTAAGAAACT AAATTTGTTC 4320 

GTTTTTTTAC TCTTGTAAAT CTATTTTTGT TAGAGTTGAT TTGGTTTACA TCCGTACTTA 4 3 80 

AATTGATTTG TTAGAGCTCT ACTTTTATTA AAAAAATTCA ATTTCAAGGA TAAATAAGCA 4440 

GTATTCTAAA GGTACTTTTA GATGAAATAA AAGCCTTTAC ATGGTATAAT AGAGGTAGCT 4500 

CTTTAATGGA GGTGTTTGAG TGGAAAATCT GAAGAAAATG GCAGGTATCA CGGCTGCTGA 4 5 60 

ATTTATCAAG GATGGGATGG TTGTAGGGCT AGGAACAGGT TCTACTGCCT ATTATTTTGT 4 62 0 

CGAAGAAATC GGTCGTCGAA TCAAGGAAGA AGGCTTGCAG ATTACAGCTG TGACGACTTC 4 6 80 
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TAGTGTGACC AGTAAACAGG CTGAAGGGCT CAATATCCCG CTCAAGTCTA TTGACCAAGT 474 0 

AGACTTTGTC GATGTGACAG TCGACGGGGC GGATGAAGTG GATAGTCAGT TTAATGGAAT 4800 

CAAAGGCGGT GGTGGTGCCC TTCTCATGGA AAAGGTGGTC GCAAC AC CAT CAAAAGAATA 4860 

CATTTGGGTG GTGGATGAAA GCAAGCTGGT CGAAAAACTA GGTGCTTTTA AATTGCCAGT 492 0 

AGAAGTGGTT CAGTATGGTG CAGAGCAGGT CTTTCGTCAT TTTGAACGAG CTGGCTACAA 4980 

ACCAAGTTTC CGTGAAAAAG ACGGCCAACG TTTTGTGACC GATATGCAGA ATTTTATCAT 5040 

TGACCTCGCC TTGGATGTCA TTGAAAATCC AATTGCTTTT GGACAAGAAT TGGACCATGT 5100 

CGTTGGTGTT GTGGAGCATG GTTTATTCAA CCAAATGGTG GATAAGGTAA TCGTTGCTGG 5160 

ACGAGATGGA GTTCAGATTT CAACTTCAAA AAAAGGAAAA TAGAAGGGGG CATAAGATGT 522 0 

CTAAATTTAA TCGTATTCAT TTGGTGGTAC TGGATTCTGT AGGAATCGGT GCAGCACCAG 52 80 

ATGCTAATAA CTTTGTCAAT GCAGGGGTTC CAGATGGAGC TTCTGACACA CTGGGACACA 534 0 

TTTCAAAAAC AGTTGGTTTG AATGTCCCAA ACATGGCTAA AATAGGTCTT GGAAATATTC 5400 

CTCGTGAAAC TCCTCTTAAG ACTGTAGCAG CTGAAAGCAA TCCAACTGGA TATGCAACAA 54 6 0 

AATTAGAGGA AGTATCTCTT GGTAAGGATA CTATGACTGG ACACTGGGAA ATCATGGGAC 5520 

TCAACATTAC TGAGCCTTTC GATACTTTCT GGAACGGATT CCCAGAAGAA ATCCTGACAA 5580 

AAATCGAAGA ATTCTCAGGA CGCAAGGTTA TTCGTGAAGC CAACAAACCT TATTCAGGAA 5640 

CGGCTGTTAT CTATGATTTT GGACCACGTC AGATGGAAAC TGGAGAGTTG ATTATCTATA 5700 

CTTCAGCTGA CCCTGTTTTG CAGATTGCTG CCCACGAAGA CATTATTCCT TTGGATGAAT 57 60 

TGTACCGTAT CTGTGAATAC GCTCGTTCGA TTACCCTTGA GCGTCCTGCC CTTCTTGGTC 5 820 

GCATCATTGC TCGCCCTTAT GTAGGTGAAC CAGGTAACTT CACTCGTACG GCAAACCGTC 5B80 

GTGACTTGGC TGTATCTCCA TTTTTCCCAA CTGTTTTGGA TAAATTGAAT GAGGCTGGTA 5 940 

TCGATACTTA TGCTGTGGGT AAAATCAACG ATATCTTTAA CGGTGCTGGT ATCAACCATG 6000 

ACATGGGTCA CAACAAGTCA AATAGTCATG GAATTGATAC ACTATTGAAG ACTATGGGAC 6060 

TTGCTGAGTT TGAAAAAGGA TTCTCATTCA CAAACCTAGT TGACTTTGAT GCCCTTTACG 6120 

GCCATCGTCG TAATGCTCAC GGTTACCGTG ATTGCTTGCA TGAGTTTGAT GAACGCTTAC 6180 

CTGAAATTAT CGCAGCTATG AGAGAGAATG ACCTTCTCTT GATTACTGCG GACCATGGAA 6240 

ATGACCCAAC GTATGCAGGA ACGGATCACA CTCGGGAATA TATTCCATTG TTGGCCTATA 6300 

GCCCTGCCTT TAAAGGAAAT GGTCTCATTC CAGTAGGACA TTTTGCAGAT ATTTCAGCGA 6360 

CTGTTGCCGA TAACTTTGGT GTGGAAACTG CTATGATTGG GGAAAGTTTC TTAGATAAAT 6420 
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TGGTATAAGA TGACGCGCTA TGCTTTGCTG GTGAGAGGTA TCAATGTTGG TGGTAAGAAT 6480 

AAGGTCGTCA TGGCGGAGCT TCGTCAAGAA TTGACAAACT TGGGACTGGA AAAGGTTGAG 6 540 

AGCTACATCA ATAGTGGCAA TATTTTCTTT ACTTCGATAG ATTCCAAAGC CCAATTGGTT 6600 

GAAAAGCTAG AGACTTTCTT TGCAGTCCAT TATCCATTTA TTCAGAGCTT TTCTTTACTG 6 660 

AGTCTAGAGG ACTTTGAGGC GGAACTTGAA AATCTACCAG CTTGGTGGAG CAGAGACTTG 6720 

GCACGAAAAG ATTTTCTCTT TTACACTGAG GGTTTGGATG TGGACCAAGT CATCGCGACA 6780 

GTTGAAAGTT TAGAGCTGAA AGATGAAGTG CTTTATTTTG GAAAACTTGG GATTTTCTGG 6840 

GGGAAATTTT CTGAAGAATC CTATTCTAAG ACTGCCTATC ATAAGTACTT GCTGAAGGTG 6900 

CCTTTCTACC GCCACATTAC TATTCGTAAT GCTAAAACCT TTGACAAAAT TGGTCAAATG 6 9 60 

CTAAAAAAAT AATAAAGGAG ACACACAATG ACATTTTTAA ACAAAATCCA TGAAACTGCT 7 020 

ACTTTCCTGA AAGAAAAGGG AATTGCAGCC CCTGAGTTCG GTCTAATCCT TGGATCAGGA 7 080 

CTTGGAGAAT TGGCAGAAGA AATCGAAAAT CCAGTTGTAG TAGACTATGC TGAGATTCCA 7140 

AACTGGGGCC GTTCAACAGT AGTCGGTCAT GCTGGTAAAT TGGTATATGG TGAACTGGCA 7200 

GGTCGCAAGG TCTTGGCTCT TCAAGGGCGT TTCCATTTCT ATGAAGGGAA TCCTCTGGAA 72 50 

GTGGTGACTT TCCCAGTTCG TGTGATGAAA GTTCTTGGAT GTGAAGGTGT TATTGTAACC 7 320 

AATGCAGCTG GCGGTATCGG ATTTGGTCCT GGTACCTTGA TGGCTATCTC AGACCATATC 7 3 80 

AACATGACGG GGCAAAATCC ATTGATGGGT GAAAACTTGG ATGACTTTGG CCCACGTTTC 7440 

CCAGATATGT CTAGGGCCTA CACACCAGAA TACCGTGCCA CTGCCCATGA AGTGGCTAAA 7 500 

AAACTTAATA TCAAGCTTGA TGAAGGTGTC TATATCGGAG TTACTGGTCC GACTTATGAA 7 5 60 

ACACCAGCAG AAATTCGTTC CTATAAGACA CTGGGAGCAG ATGCAGTTGG TATGTCTACG 7620 

GTTCCTGAAG TTATCGTGGC AGCCCACTCT GGCTTGAAAG TTCTGGGAAT TTCATGTATC 7 680 

ACTAACTTTG CGGCCGGTTT CCAAGAAGAA CTCAATCACG AAGAAGTTGT AGAAGTGACT 7740 

GAACGTGTTA AAGGTGATTT CAAAGGCTTG CTTAAAGCGA TTCTTGCTGA ATTGTAAGAA 7 800 

AAAAGATTTA AAAGGGGGAG TGCCTCTGTT TTTTCAGGAT TGACTGCCTA TCCGGATTAA 7 8 60 

AGAAGAAACA GAGGAATACT ATGAGCTTCT TCCTGCTCTT ATAACTGAAA GAAGCGGAAG 7 920 

AATAGGTATG TCTGATCTGA TAGCCAGCAT TGTGAAAGAC AAGATTCTAG GATACTAGCA 7 980 

TTAGCTTCCT AGCCAAGCAG ACTAGTATGA TAAGGAGAGA TGAGAATGAA TTGACTTTCT 8040 

GAATTTCTCA GTCTTATCAT ATATAGCACA ATGAGATTTC GCTTGAGTCT GCTTGTAAAT 8100 

AAACGAAAAG AAAGATAAGA AATAATGAAA ATTGGTCAAC GAATTATGCG CTTTGGCATA 8160 

AAAAATTAAG TATCGGAGTT GTATCTGTTG TAGTCGGCTT TGATTTCTAG CTCCAGCTGG 8220 
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AATTTCAGCC AATGAAGTAA AGCAAGATGT AACATCTGAA GTGGTAATAG GTGTGCTAGA 82 80 

TTCTAAGGAG GAATTGAAAG AGTCAGAAAA TGATGCTCCA AAACTAGAAA CTCCTCTTAG 8340 

AGAGGAGCCA AGACTAGCTC CTCAAACGCT TCCGGAAGCA AGTGAAGTTC TTGAAAACAA 84 00 

AAGGGAAGAG TCAAAAGTAG AGATAACATA ACCAGCTCAA GCGGATGATA TCCGCAAGGT 8460 

TGTTGGGGAA TTAGCCAAGG ATATAAGTAT TACTAAGTTG TATATGACAG GTCATTCTCT 8520 

TGGATGTTAC CTAGCTCAGA TTGCAGCGGT TGAAGCTTAC CAAAAATATC CTGATTTTTA 8580 

TAACCATGTA TTGAGGAAAG TGACAACTTT CAGTGCTCCT AAAGTGATTA CTTCCAGAAC 8640 

TGTTTGGAAT GCTAAGAATG GTTTCTGGGA TGTTGGTTTG GAAAGTCGTA AATTAGCTGT 8700 

TAGTGGAAAA ATTAAGCATT ATGTGGTTGA TAATGACAAT GTTGTGACTC CCTTGATTCA 87 60 

TAATAATCGT GATATTGTTA CATTTACAGG TAATTCACGC TTTAAACACC GTTCTCGTGG 882 0 

CTATTTTGAA AGTCCAATGA ATGATATTCC TAACTTTAAT ATTGGTAAAC AAGCTACCTT 8880 

GGATAAACAT GGTTATCGTG ATCCGAAATT GGATAAAGTG CGATTCTTTA AGAAACAGGC 8940 

TCTGCCTCGA TCTTCTAGTC AACCAAGCGC TGAACCAATG GAAAATATTG CCTCAGGAAA 9 000 

ACAGGTTACT CAAAGTTCGA CAGCTTTCGG AGGAGATGCT AGAAGAGCTG TGGATGGCAA 9060 

AGTCGATGGT AACTATGGTC ACAATTCTGT CACTCATACA AACTTCCAAT CTAAGCCTTG 912 0 

GTGGCAAGTA GATTTGGCTA AAGAAGAAAC CATTCGCCAA ATCAATATTT ACAACCGAAC 9180 

AGACACTGCC CAGGATAGAT TGGCAAACTT TGATGTCATT CTTTTAGACA GTTCTGGTAA 924 0 

AGAAATTGAG TGAAAACGTA TAACATCTCC TAAAGATGTG TCAGCACAAA TTACGATTAA 93 00 

CCATAAAAAA GCGCGCTATG TTCGGATTGA GCTAGAAGGC TATAATGCCC TCAGTCTTGC 93 60 

AGAAGTTGAA GTTTTCTGCT TTATAGCTAC GAATGCTGAA ACGGCGACAC AAGTTTCTAA 9420 

GCCAGTTCAA CCAATCAGTC AGACTCCTGT GAAGGATAAA ACATTGACAA TTCAACACAG 94 8 0 

TGGAGCTTAC ATTGCCCGCT ACTCCATAAC TTGGGAAGAA GTTCCAGTAG ATAAAGATGG 9540 

AAACCAAGTT GTTCGTAGTC ATTCTTGGGA AGGAAGCGGT CGCAACCAGA CTGCAGGTTT 9 600 

TGTCCTCAAC CTCCCAATCA AAGAAAATAT GAGAAATCTG CGAGTTAAGA TTGAGAAAAA 9660 

GACGGGCCTA CTATGGAATA GATGGCAAAC AATCTATGAA AACAGACCAA TTTTAGCTCA 972 0 

ACCCCACCGT AAAATTACCC ATTGGGGTAC GACATTGAAT TCCAAGGTGA GTGACGATGA 9780 

TGTCTTGTAA TCTGATGGTA GAATGACAGT TAGTTTGTCT AGTTTATAAG AAAGTACTAC 9840 

CTGAGCTTGA ATAGGACTCA GGTAGCTCTC TATGAAAGAA CAAAATTAAT ACTCAATGAA 99 00 

AATCAAAGAG CAAACTAAGA AACTAGCCGC AGGTTGCTCA AAGCACTGCT TTGAGGTTGT 9960 
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AGATAAGACT GACGAAGTCA GTCACATATA TAATCCAAGG CGACGTTGAC GTGGTTTGAA 10020 

GAGATTTTCG AAGAGTATAA ACAGAAAGGT AGAGCGCGTG TTCTAATTTG AACACGAGTA 10080 

GAAAACTTTT CTAAAAACAA AAACGAAAGG ATGGGTAAAC TGTATTCGCT GAACTGAATA 1014 0 

CGGGCGACTC TCCTCTAAAT CAAAATTAAG AAAGGAATTG ACCCCACCCT AAAAGTAGTG 10200 

GGAAAAAGAT AGTTGATCTA GCGAGCATCG CTCACTGCGC CCAACTCCTA TTTTCCCTTC 10260 

GCTTTTTGAT GGGTTTGGTA TCTTTCTCAA TATAAAATAT AAAATAAAGA AAGGTAGAGC 10320 

GTGTGTTTTG ATTTGAACAC GAGCGGAAAA CTCGGAAAAT AGATAATCTG ACTGAAAAAT 10380 

CAGGATTTCT CGTCAGGTTC CTAATTTTCA GTCGTTTTCT TCTCGCTCTT TGTATCATAA 10440 

ATTATGTCTA TCCATATTGC TGCTCAGCAG GGTGAAATTG CTGATAAAAT TCTTCTTCCT 10500 

GGGGATCCTC TTCGTGCTAA GTTTATTGCG GAGAATTTCC TTGATGATGC TGTTTGTTTT 1056 0 

AACGAAGTGC GTAACATGTT TGGTTACACT GGTACTTACA AGGGTCACTG TGTATCTGTC 10620 

ATGGGAACTG GGATGGGAAT GCCATCTATT TCGATTTATG CGCGTGAGTT AATCGTAGAC 10680 

TACGGTGTGA AGAAATTGAT TCGTGTGGGA ACTGCAGGTT CTTTGAATGA AGAGGTTCAT 10740 

GTTCGTGAAT TAGTTTTGGC GCAGGCGGCT GCAACCAACT CAAACATCGT TCGTAATGAC 10800 

TGGCCACAGT ACGATTTTCC ACAAATTGCT AGCTTTGATT TCCTTGATAA AGCCTACCAT 108 6 0 

ATCGCCAAAA AACTTGGTAT GACTACTCAC GTTGGGAACG TTTTGTCATC TGATGTCTTT 10920 

TACTCAAATT ACTTTGAAAA GAATATCGAG CTTGGTAAAT GGGGAGTCAA GGCTGTGGAA 10980 

ATGGAAGCAG CAGCTCTTTA CTATCTTGCT GCCCAATACC ATGTTGATGC GCTAGCTATC 11040 

ATGACCATCT CTGATAGCTT GGTCAATCCA GACGAAGACA CAACTGCAGA AGAACGTCAA 1110 0 

AATACCTTCA CTGATATGAT GAAGGTTGGT TTGGAAACCT TGATTGCAGA ATAATTATAG 11160 

CCAAAAAGGG GCTCTTTGTC AACTGTAGTG GGTTGAAAAA AAGCTAAGCT TGAGAAAGGA 1122 0 

CAAATTTCGT CCTTTCTTTT TTGATATTCA GGGCGATAAA AATCCGTTTT TTGAAGTTTT 11280 

CAAAGTTCCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGTCGCTT 11340 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAC GATTTTCTCT TTGTTCTTTA 11400 

GAAAGGTTTT AAAGACAGTC TGAAAAAGAG GATGAACCTG CTTCAGATTG TCCTCAATGA 11460 

GTCCGAAAAA TTTCTCAGGG TCTTTGTTCT GAAAGTGAAA AAGTAAGAGT TGATAGATCT 11520 

GATAGTGGTG TTTCAAGTCT TCTGAATAGC TTAAAATCTT GTCAAGAATT TCTTTATTTG 1158 0 

TTAAGTGCAT GCGAAAAGTA GGGCGATAAA AACGTTTATC GCTsArTTTA CGACTATCCT 11640 

GTTGGATGAG TTTCCAGTAA CGCTTGATAG CCTTGTATTC ATGAGATTTT CGTTCAAACT 11700 

GATTCATAAT TTGAACACGA AAACGACTCA TGGCACGGCT GAGATGTTGG ATAATATGGA 117 60 
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AACGATCTAG AACGATTTTA GCACACGGAA AAAGCTGTTT 
TAAACATATC CATCGTAATG ATTTTCACTT GACAACGAAC 
AGTGATTTCG GATGACAGCT TGTGTTCTGC CTTCAAGAAC 
CAAAATCTTG CGCAATGAAA CTCATCTTTC CCTTAGTGAA 
TAATCTTTGG AAGCCGAGAA AAATCATGCT CAAAGTGAAA 
CAGTTGAAGT TGAAATGGCC AGCTGATGGG CAATATCAGT 
ACTTTTGAGC AATTTTTTGG TTGATGATAC GAGGGATTTG 
GAGTCTCAGC AACCATCATT TTTGAAsAGT GATAGCACTT 
GAATTCTAGA AGGCATACCA GTTGTTTCGA GGTAAGGGAT 
CATrTTTCTT CATTAGACTT CCACAATCAG GGCAAGATGG 
CGATAATTTC TTTGTGGGTA TCCATATTGA TGATATCTAG 
TAATATCGAG CAGTTTTGTG ATAAAATGTA ATTGTTCCAT 
GTTTTGTCGC TTTTCATTAT AGGTCATATG GGACTTTTTT 
ATAATATCTA TAGTGGATTT ACCCACTACA AATATTATAG 
TATGAATTGT AGGACTTCCT TTTCTTATCC AGAAATTGAT 
AGAATAGTGA CTTTATGTGA ATATTCTTGG CAAAGTTTTT 
TTGCGGACGC CCATCCCAAA GAATCCATCT GATAAACTCC 
ATCTACCGCC ATACTTTCTC TGACTTTTCC ACGGTATTTA 
AAAGAGACAG GTCAATGGCG AAAAATTGAG AAAGATGATT 
TTCTTGGTAG TAGCACCAAG AATAATTACC ATCGATGACC 
AAAGTTTTTT ATCTCGGTTA ACATCCATTC GCAGTCACTG 
TTGGAGTGTG TCCATGTGCA GTTTTGGAAT GGAGTAGTAG 
AGTTGACTTA CCAGAACCAG AATATCCGAT AATTGCGATT 
ATTTGGAGAC AAAAAAACAG CCTCTATGGA CTGTTTCTTA 
GACGAGCTTT ATCGCGGCTT GCTTTGTTTT TGTGAATCAA 
CGATAGCTGA GCTAGCAGCA CGGAAAAGTT CTTCAGATGG 
TAGCAGTACG CATAGCTGAT TTTTGAGCTG AGTTCTTTTC 
CAGCGCGTTT GATAGCTGAT TTAATGTTTG CCAATGGTCT 
(2) INFORMATION FOR SEQ ID NO: 129: 



AGCCAAGTCA TAGTAAGGAC 1182 0 

GGCTCTATCG TAGCGAAGAA 11880 

AGTGATAATA TTAAGATTAT 11940 

GGCATACTCA TCCCAAGACA 12 000 

GTCATTGAGC TTGCGAATGA 12060 

CATAGAAATT TTTTCAATTA 1212 0 

GTGATTTTTC TTTACCAGGG 12180 

GAAACGGCGT TTTCTAAGGA 12240 

CTTAGACGGT TTTTGAAAGT 12300 

AGCCTCATAA TCCAGCTTAG 12360 

AATCTTGATG TTTGGGTCTT 1242 0 

ATGATTCTTT CTAATGAGTT 1246 0 

TCTACACAAA AATAGGCTCC 12 540 

AGCCCAAAAA GGAAGCCCTT 12 600 

CTAGCTCTCT CTGATTTCGA 12 660 

GGTAATTTTC TTTTTGAGTT 1272 0 

CACTCAAAGC GTTCAGGGCA 12780 

AGATAACGCT TAAAGGCTCT 12840 

TGGTCAGCTT CTTGCATTCG 12900 

CAAGCTTTAT GCTTGGTGAG 12 960 

TCTTGCCAAC CAGGTTGAAA 13020 

TTAGATAACT TTTCTGCTAT 13080 

TTCATTTTCT ACCT7TTCCT 13140 

TTTAACAAGT TTAGCTGAAA 13 200 

ACCTTTAGTT TCTGCTTTAT 13260 

GTTTGCTTCG AAAGCTTTTA 13 32 0 

GATTCGTCTA ACGTTCAATT 13380 

TACCTCCATA TTTACTAACT 13 440 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 8512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 129: 

CCTTTTTTCA AAAACTAGAT ACTAGTCTAT CAAAAGTAGG AAAGGGTTTC AAGAAAATTG 60 

ATTGGAAATT TTTTGAAAAT CATAGAACTA TTAGCTAATC CCTAGTATTG AAAAGACTGG 120 

ATAGCTTCTT TCAGGTCATC TTGTAAACTA TTTCTCTGGT CAAGTTGGAC ATAGACTTCC 180 

ACCAGACAGG ATCTAAAGTT GGAAAATTTG TAAAAATCCT CCCTTTCTTC TATCGGAAAA 240 

TCAACAGTTT TTATCCAAGA AGCTACTTGT TCTTGCTCCA ACTTCCCTTG TAAAATAGGT 300 

TCATAGATCA CTCTTGCTAA ACGCCAATCC TCATCATCTG TAAAGCGAAT CGACATTCTT 360 

TTAAATAGTT GGCCAAGTAT ATCAAATACT TCATGAACTC TGTTTTTAGG AAAGTCTGGA 420 

TGACAAACCA CCTCTGTCAG TAAATCGGCT CCATGTGCAA AAGCGTGAAC CCAACCATAC 480 

TGACTTGAGA AACCCCTTGT ATCCTTTTCT TTTGAAAGAT AGTGCAAGCC TTGATTTAAA 540 

AGGACATTAC GAATTTCTGG AGAAGGATTT CCCAAATGAT CAAACAACCA CTGGATTTCT 600 

TCCTGGTTAT AATTTGGTTT TTCTTCTGCT ATTTTTCTTA GTAAATCTTG ATACATGGTC 660 

AATACCTCTA CATTTCTAGC AACTGTTCAA AAAGGCAGTC TTAAATGACT CAATATTGAA 720 

TTCTCAATTA AATACAATCT GATATAAAAT GACGTAAATA ACTATCAATA CCAGTTCTAC 780 

AGTAAGTTCA AATTTAACAT CACGACCTTC AACGACATTT TTGAAAATAG CTACAACTAA 840 

GACAAATAGA ATGACGCTTA ACAAGCCCAT AAACATCATT CTAAAAAATT TTTCTATTCC 900 

CCTACTCTCC CAACTCAGCA CTATAGGAUA TAA'I'CTGGTC AACTGTGTCA GACAAGAATT 960 

GGATGGTATC ACGGAGTGGT TTGTCTGTTG AAATATCAGC ACCGATAATC ATGGCTGACT 1020 

CAAGTGGTGT CTTGCTACCA CCTGATTTGA GGAGATTGAG CCAGTCTTCA GCTCCAGTTT 1080 

CAGAATGTTT TAGATGAAGG TAACCAGCAG TCGAGATAAC TAGTCCTGCT GAGTAAGTGT 1140 

AACTATACAA GCCCATATAG TAGTGAGCTT GGCGCATCCA AGTCAGAGTT GCATCATCGT 12 00 

CAATTTCAAT AGCATCTCCC CAGAAATCCG TCAAAACTTC CTTCATAATG CTGTTGAGCT 12 60 

TGCTTGCTCC AAAGGTCTCC CCTTCTTCAA TCAATGTATA CACCTTACGC TGGAAGGCGG 1320 

CTTCCAAGAG GTGGGTGATA AAGTTATGGA AGTAGGTGTC TGTCAAGCGA TGAGCCAGAG 13 80 

CGAAGCGTTT TTGACGTGGG TCATTAGACT GGTTCTCCAA GTAATCACTG AGTAGCAATT 1440 

CATTGAAGGT TGACGGTGCT TCAACATAGT AGGTCGACAT ATGGGCATTG AAGTAACTTT 1500 
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GATGATTGTC TGAAAAGATG AATTGACCAG AATGCCCGAT TTCATGAATC AAGGTATAGA 1560 

CATCGCTCAA ACGGCCTGTC CAGCTCATGA GTACATAAGG GTGTACGCGA TATGGGTCCG 1620 

CCGCATAACC ACCGGAATCC TTGCCACTGT TAGCAGCAAA GTCCACCCAG CGCTCTTCTT 1680 

GGTAACGAGC AACTTCCTGA CAATATTCTT GCCCCAAAGG TTCTACCGAC TTCATGACCA 1740 

AATCATAGGC ATCGTCAATA GTCACTTCAG GATTCAGGGC GCTGTCCAAG TCCAATTTCC 1800 

AGTCTGCAAA GGTCATCTTT TCAAGACCAT TTACCTTGGC AACATGCTTG AGGTATCTCT 1860 

GAGCGACTGG TGCAAAGTCC TTCATGATGA GGTCAATCTG GCGGTCAAAC ATGACACGGT 1920 

CCACTTCTTG TTCAGCTAGA AGATAGTCAA AGACAGAGTC GTATCCCTTC ATATCAGCCA 1980 

AGAGTTTTTC AGACTTGACC TGAGCCAGAT AGGCTGCTGC AGCCGTATTT TGGTGCTTAC 2040 

GAAGTCCCTC TGAGAAGGAA CGGAAGGATT TCTCACGAAC CTCAGCATCC TCATGGTTTT 2100 

GGTAGAAATT CTCATAGGTC ACAAAGCTGT TTTTGTAGGT CTTGCCATGG GCTTCAAAGT 2160 

CAGCCATTTC AAAATCCCCA GCTCGCATCT TAGTATAAAT GTCCTGCGGA CTGTAGAAAA 2220 

CTTCACCGAG ATTTGTCAAG GCCTTCTCCA CATCTGCCCC TAAGTAGTGG GCTTTTTTGA 22 80 

TTTTAGCCTG ACGAATGGCA GCTGTTAAAT GTGGCAATTT ACCCAAACGG TCCAAGACTT 2 340 

CCTCATCTGC TGCCACCAAG GCATCGTCAA AGAAGGTCAA GGCTACGCTG GCATCTGTTT 2400 

CAAATTCCAT CCCAGCTTGG GCAATATTGG CAAATTCGTC ATTGCTATAG TCCGTCGTCT 24 60 

GAGGCATAAA ACCATAGTTG CCAATATGGC TCATCTGAAT GTAGATCTGT TCCAATTCCG 2520 

CAAAGGCCTT CTCGAAATCC TCAAAAGTGT GAAGATTGCC CTTGTAATCA CGGCTAAACT 2 580 

GGTTGATGTC TTCGCGAGCT TTCTCGATTG CACGCAAGAA ATCCTCACGG TCTTGGTATA 2 640 

GGGCTGTTAA GTCCCAGAGT TCCTTCTCTG GAAATTCTGA ACGGTGTTTT TGTTCCATTT 2 700 

TCTTCCTCTT ATTTCTCTAA TTCTACTAAA ACACTAAGGG CTGATAAAGC GTAAAGCGGT 27 60 

GCTGTTTCTG CTCGCAAAAT ACGAGGACCT AGGCCTGCCA AAACGGCTCC TTTAGCTTCA 2 82 0 

AAACTTTCGA TTTCTGCAGG TGAGAGACCG CCTTCTGGAC CAAAGATAAA GAGCAGTTTG 2880 

GCTCCTGTTT CAAGACCAGT GACTGCTTGC AGAAGCGCAG CGGCTTCTCC TTCTTTAGCT 2940 

GATTCTTCAT AGGCTACTAT GATAGAGTCA AACTGGTCCA GCTGAGCTAG AAAATCTGCT 3000 

TTTTTCTCGA AAAGTTTAAT ACTTGGTACA ATATTACGCT TGCTTTGCTC GGCTGCTCCA 3060 

AGGGCAATTT TTTCTAGTTT TTCAACTTTT TTACCCAATT TCTTGCCATC CCACTTGGCA 312 0 

ACTGACCAGT CTGCAGGAAA GGCCCAGATT TGGCTAGCCC CCAGTTCGGT TACTTTTTGA 318 0 

GCGATGAACT CCAGCTTGTC TCCCTTGGGA AATCCAGATG CGATGGTCAC TTGGACTGGT 3240 
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AGTTCCACAT TGTCATTTAA TTCTTGGACC AACTCAAACT GACGATTTTC CATATCCAGC 3 3 00 

ACGCGCGCCA AGCGCTTGAT GCCATCATCA AAGACTAAGG TAACCTCATC CTCTTCTTTC 3 360 

AAGCGCATAA CCTGAAACAT ATGCTTACTG GTTTCCTTGT CCTCGATAGT GACAGGAGAG 342 0 

ATAGCACTGC CTTTTACAAA ATACTGCTGC ATGCTAGCCT CCAATCACAC CAGAGATATC 3480 

CTTGGTTTTC TTAAAGACAC AGGTATTCCA TTCCCCTTGA ACCATGTGAG TTTCGAGGAA 3540 

AAATCCAGCT GACTCAGCCG ACTGGCGCAC CATGTCCAAC TTGTCCTTGA TAATGCCACT 3 600 

CATGATCAGG TAGCCTTCAT CCTTTACCAA GCGATAAGCA TCGTCTATTA GATGAATGAG 3 6 60 

GATATCCGCC AAG AT AT TAG CCACAATCAC ATCTGCCTCA ATTTCCACAC CCTTAAGCAA 3 720 

ATCTCCAGCC GCTACATGGA TATTTTCCAT GCCAGGGTTG AGCTCAATAT TTTCCTGAGC 37 80 

CACACGAACC G C C AC AT CAT CCAGGTCATA GGCGAAAATT TCTTTAGCCC CCAGAAGCGA 3 840 

GCTGGCAATA GAGAGAACCC CTGAACCAGT CCCCACATCT AGCACCGTTT CGCCACCACG 3 9 00 

AAGAACCTGT TCCAAGGCAA AAAGGCTCAT CTTGGTAGTT GGGTGGGTTC CAGTACCAAA 3 960 

AGCCATGCCA GGATCCAGCT TGATAATCAT TTCCCCCGCA GTCGCCTCAT AGTCTGTCCA 4 02 0 

AGAGGGAACG ATGGTCAAAT CATGAGTGAT ACGAGCAGGT TCATAGTATT TCTTCCAGTT 4 080 

GTCTGCCCAG TCTTCCTCAG CCAAGGCAGT CGTACCTATT TTTAACTCTC CCAAATCCAT 4140 

AAAATCTGTC AATTCTGCTA GACGAGCCTG CAAATCCGCC TCAACCACTG TCACATCCAC 4 2 00 

CGTGTCAGGG TAGTAGGCTG TCACTACGAT TTCTTCTTGC TGCTCCACCT CTGGGAAAAT 42 60 

CTCTCCAAAG CGGTCCACAT TTCCCACATA GTCCATACTG TCTTCGATTG CGACTCCTTG 4320 

CGCTCCCAGC TCAATCAAGA GATTGGAAAC CAACTCCTCT CCCTCACGCT TCACTGTAAC 43 80 

TTTTAACTCT TGCCATGTTT CCATTATTAA TACCAAGCCC GTAAAACACA AAACCAAAAT 444 0 

AGGAAATTCT CTGAAGACGC TTGTGTCTAA GAGAAGTTTA TCTTTTTGGC ACAGTGTTTA 4 500 

GGGCGGGTTC AGTTTAGAAA TGTAACTGAA CCATCCTTTC TAATCACTTA CTTTTAAATA 4560 

ATCTTTTAAT CTCTCTTGCA ACTGAGGCAC AACTTGACTG GAACTAAGAA ATTCCTCAAC 4 62 0 

ATTCATCAGC TGATAGCCCT GTCCTTCATC TCCGAAGATG ATATTGTCAA ATTGTTCTTG 4 680 

TCTTAGCTGA CCAACCATAA AGACCGATTT CTTGCCTTTA AAAATTACGC TAGGATAAAT 474 0 

CTTGCTCCAA AGCAGACAGT CTTCATCTAA ATGAATTCCC AGTTCCTCAT AAACTTCACG 4 800 

CCGAGCGCAT TCAAAAGGGC TTTCGTCCCC TTCACGGCCA CCACCTGGCA GTTCCCACAT 4 8 60 

ATTGGCCCAG GGAATACTTG CCTTATCATC GCGTAAGATA GTCAAAAGCT TATCCCCACA 4920 

AAACAAAGCA ATCTTGCAAC CTGTGAAATC AGAAATTTCT AGTTCCATCT TCAGTTCCTT 4980 

CTAACATTTC CTTTTCCAGC TCGGCTAACC AGTTTTCATA ATATCTTTTC TCATCCCTCA 5040 
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ACATTCGACT ACTATCCATT 
CATCTTTCTT CACCTTTAAT 
GAGACAGAAA CGATTTGACC 
CTTCCAACAA GGCGATATGA 
TTGTCCTCTC AAGTAATGCT 
TGGAAACTTC ACCTTGCATA 
GAACTGCTTG AACAAAGTCT 
AATGTCTCAA TACCAGGTAG 
TTTTTGCTCC ATCGGTGATG 
GAAAAATCCA AGTCCCTGCA 
ATAATATCAA GCCGAGATGA 
CGCTTTCATC CTCTTTTAAA 
TACTAAGATG AAACCTGCCT 
CCAGCCGATA GCCTGTCAAG 
TTAAATACAT GCTTAGAAGA 
ACCCCAACTC TGCAAATGCG 
CTAGCACATA AACCAAATAA 
ATCTTGGATA AGGATAAAAT 
CTTGGTTCCA TTCCATGACA 
CATCTAGCCC AACCTCAGCA 
GGGAAAATCC TTTTTTAGGC 
TTGACTTGGC AGGATTGTAG 
CCTCTCCGTC TTCATCCACC 
CACTTCTACC GCATGGTTGC 
GTCCAACTCT TCCTCTAAAA 
TGCGACTCTT TAATCGCCCC 
CCACAAAATT ATTTTATGTC 
GTGATTTTCT TAAAAGTCTG 
GAAAGGGCTA ATAAATCTAT 
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TTCTGTCTAG CAATCTTGAG AGCCTTACGA GTTCGATCTA 5100 

TGATACCAGG CTTGTATCAC TTGAAGATTG GACAGTTTGA 5160 

TGTCGAATAC TAGCATATTG CTCCGCTTGC TCAAAATCTC 5220 

AGCAGGGATA GTTGGGCAAC TGTCTGCATC ATCGGAGTAG 5280 

TGAAACTGCT GTTTAGCTAC TTCTTCCTTC CCTTCCAAAA 5340 

CCTAATACAC CATCCGCAAA ACTCCCTCGT GCATCCTCAG 5400 

TTCAAATCAT ATTCTTGAGG AGCTAGCAAG GTCTGGGCAG 5460 

GCGTATTTGG TATTTTCAGG GTGTTGTAGT AATTCCCAAA 5520 

TCGACTGGCA AAATGTTATT TAGGAAGAAA GATAAATTAA 5 580 

AAATACCAGC TTCTTGTCAA AAATCCAAAC AATATCGCCA 5640 

ACCATCAAGC CTCCTGAAAG CATCAGGATG ATTCTTTGAT 5700 

CCAATGTATT GAGCACCAAC ATTTTTCAGA ATGGCTGTTC 5760 

GACTTTTTGG TCAAAATAAA ATGTCCTAAT CCAAAAGCCA 5 820 

TAGCCACAAA AAGCATGACC CAGCTCATGA AGAATAAAGA 5 880 

GCGAAGGCAT AACCAAAAGT AAAGGCTAAA ACTGCGGAAT 5940 

ATTGTTCCAC AAGCAAAAGC TAGCATAATA AAGACAACAG 6000 

GTCCCAATTT TCTTCATAAC ACCTCCAACC AACTCCTAGT 6060 

TCTCCCTTTT CCAAGCCAAT TTTTCCTTCT TCAAAGACTT 6120 

AATTCCTCTG CTTCTGGGTC TTCCAAAAAG TCCATGAGGA 6180 

GTATCTTTAA GGAAAAGCGC AAAATAAGCT AAAAATTCAC 6240 

AGGTAAGGAA TAACAGTCAA ATAGTCTTCC TCATTGACTG 6300 

AAAAGGACCG CTTCCTCAAA AAGAATGTCA TCTGATGAAA 6360 

ATCTCCACAC CGCAGCATTT TGCGCTTCCA ATAGAAAACT 6420 

GTTTGTCCCA GCTAATCTCA AAGTCAAAGG GAAAGTTCTT 6480 

TATCTAAAAA TCCGTATGTT GCCATTTTGT CCTCTTTCTA 6540 

GATTGCTCGG AAATATGCTA AAATAGATAC TACCATCTTA 6 600 

CTAATTATAC CATATTACCT CATTTAAACC CTTGGTATCA 6660 

ATTTCTTCAT TTCTCATAAA AATCAATATA AAAAGCCCTC 6720 

AAAATCAATA GGCGAGTAAC TAGCACAAGT GGACGTGCTT 6780 
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TTTTATTGAC TATTACCACG ATACCACGCT TAATCTTAGG CTTGAACTTT CTTATCTGCA 6840 

ATAGCGTCTG TCAAAGTCTG AGAAAAGTTA AGCCCCATTT CTCGTCCCAA CTTATCTGCC 6900 

CATTTTGGTA TGGTCAAAGT CTTTTTAATG GGTTCCTGAC TTCCTAGGTA TTCTGATACA 69 60 

TCAACAGATA CCATAGAAAT AAAAGATTTA TCAAGGTCAT AGGTTGACAC GAAATCTTCA 7 020 

TCATCTTTAA AAGGATCATT ATCAATTAAA GACAAGCTAT TGATATCTGA TGGCTGAGGT 7080 

AACTCTCCAT CACTCTCTAT CAAATCTGCA ACAGTTATCC CTAGCCACTC CGACCCCATA 7140 

GCCAAAGCCT CAGAAATCCC CTCTCCTTGT GTAGCTGAGT ATTCAAAATC TGGGAAATGG 7200 

ACAAAATAAG TCGGTTCTGT TCCGTCTGTG TCGTCATAAT AAAATAAAGC TGGATACGTA 72 60 

ACTAACATTT CACTACCTCC ATATCAAAAA GCAGGGACTG AATTTTACAA CCCAGCTTGC 7 320 

TTTCTTATCC CTCTTTCAGT GTACTTATTC AGCTCACCAT GAAGGATTGT GATAGGTCTT 7 380 

TCCCCTTGCT TTTCCATTTT AATATGGGAG CCTTTACCGC CTCTAGTCTT TATCCAACCA 7440 

TGGGCCGTAA GGAGTTTAAC CATCTCTTTT TGTGTCATAG GCATAGCGCT TTTACCTCCT 7500 

GACAACACCA TTATAACACG TGTTACACGT ATTGTAAAGG AGTGATACTT ATTATTCTAT 75 60 

TATACATAAA AGCCCCTAGA TGTGGTTCTA AGGGAAGCCA ATTTATTCAT ACCTATTTTT 7 620 

CTAATGAGTA GTAAAAACTG CTTCTTTATC GAGCAATTCA TCATCTGTAT AGTCAATTGT 7 680 

AAAAGTATCT CGATCTAAGA CAGATTGAGG CGGAGTTGAA TGAATCATAG GAACACTGCG 7740 

TACTCTATAT TTTTTATCTC CAATTTTTAC AAACTGATAC TCTTCGAAAA TCAAATTCAA 7800 

ACCACGTCAA CGTCGCCTTA CCGTACTCAA GTACAGcCTG CGGCTAGTTT CCTAGTTTGC 7860 

TCTTTGATTT TCATTGAGTA TGATTAACTC TCAAGTCTTC GAAATCAGGA TTTTCAACAG 7 92 0 

TTATTACAAG GAGGCGATTT ACTACTTCAA AAACATCAAT TATTCTATTT TTCATATTTT 7 980 

TTCAACCCAT TATTAGAATG AACTTCTTGG TAAGCAAAAT CAAGTTTAGA TTTAATGTTT 8 040 

TCGTACAAAT CTAAAATCTC TTTTGGAGTA TCTTCCCGGA AGAAAAGTTT TCTTTTCCCT 8100 

GAAATAACTT GATCACTAAG AATCCAATGA CGAATTTGTT TTGTAAAAAT CAAAATTTCC 8160 

TGACTTGGTA GTTCCATCAT TTCCATTGCT TATCACCTCT CTTTTCATTA TAGTTCATAC 82 2 0 

AATGACATTC AGCAATATTA TTTCTCAAGT CAGCACTTCC ACTTCTTTAG GCTCAACTAT 82 80 

CCTATTTTGA GCTTTAAGGA AAATCAAATC TCTCATGCTG ATACCTCTCC TCATTAAATT 8340 

AAATAGTAAA AAAGATTCTA TCTCACTCCC TGATTATTAC AAAACCATTG AAATATCACA 8400 

ACTAATAGGC TAGAATGGAC ATAGTAAGAT ATAGTAGATG AGTCATTCTA CTCAAATCCA 8460 

CGTTAGAAAG GACTGCTATG CCAGACAATC TCGCCGTTCG CATGCGCCCn GG 8512 
(2) INFORMATION FOR SEQ ID NO: 130: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2869 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 130: 

CTCGTTTCAA GGTTGAGTCT CTTGCAAATC TTGTTCGCGT TCTTCCTTTT GCCAAGGCAT 60 

CTCTCCCATG GTTGGTGCcA GCCATTGTTG GAATCTTGCT CTCATTGGTT CTACCAAACA 12 0 

AGCAAGAAAG CGATGTTTTT GAAATGGAAT AATCACTTAA ATCACTTTTG TAGCCAAGTC 180 

TACAGGAGTG ATTkTCTTTT TTTATCCGAT GATAAATGTG TTATAATAGG TAGCGAAAGA 240 

GGTGAAGAAA TGAATCAAAC AGTAGAATAT ATCAAAGAAC TGACAGCCAT TGCGtCGCCA 300 

ACAGGCTTTA CTCGTGAGAT TGCGGACTAT TTAGTCAAGA CTCTAGAAGG TTTTGGTTAC 360 

CAGCCGGTTC GCACATCCAA GGGCGGTGTC AATGTAACTA TTAAAGGTCA AAATGATGAG 420 

CAACATCGCT ATGTGACTGC CCATGTAGAT ACGCTTGGTG CTATTGTCCG TGCTGTCAAA 480 

CCAGACGGCC GTCTCAAAAT GGACCGTATC GGTGGCTTTC CTTGGAACAT GATTGAAGGA 540 

GAAAACTGTA CCATTCATGT GGCTAGCACA GGTGAAAAAG TATCAGGAAC CATCCTCATC 600 

CACCAAACTT CTTGCCATGT CTATAAGGAT GCAGGAACTG CAGAACGCAC GCAAGACAAT 660 

ATGGAAGTGC GTTTGGACGC CAAAGTAACT AGTGAAAAAG AAACTCGTGC TCTTGGCATT 720 

GAGGTCGGTG ATTTTATCAG TTTTGACCCA CGAACTGTCG TGACAGAGAC AGGTTTTATC 780 

AAGTCTCGCC ATTTGGATGA CAAGGTCAGT GCGGCGATTT TGCTCAATCT CCTTCGCATT 840 

TATAAGGAAG AGAAGATTGA ATTGCCCGTA ACAACTCATT TTGCTTTTTC AGTCTTTGAA 900 

GAAGTGGGAC ACGGTGCAAA CTCTAACATT CCTGCTCAGG TAGTAGAATA TCTGGCTGTG 960 

GATATGGGAG CCATGGGAGA TGACCAGCAA ACAGACGAAT ATACAGTGTC TATCTGTGTC 1020 

AAGGATGCTT CTGGACCTTA TCACTATGAC TTCCGTCAAC ATTTGGTGGC TTTGGCGAAA 1080 

GAGCAAGATA TTCCATTTAA GCTGGATATC TATCCATTTT ATGGTTCGGA CGCTTCAGCG 1140 

GCTATGTCTG CAGGGGCAGA AGTCAAACAC GCCCTTCTCG GTGCTGGTAT AGAGTCTAGC 1200 

CATTCCTATG AGCGTACCCA TATTGACTCG GTGATCGCAA CAGAACGAAT GGTCGATGCT 12 60 

TATCTTAAGA GCACGTTGGT GGACTAATAT GTGCCTTATT TGTCAGAGAA TTGACCTCAT 1320 

CAAGAAGGAA GAAAATCCTT ACTTTGTCAA AGAGTTGGAA ACAGGCTATC TTGTGGTTGG 13 80 

AGACCACCAG TATTTTGAAG GCTATAGTCT CTTTCTAGCC AAGGAGCATG TCAGCGAATT 1440 
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GCACCATTTG AAAAAGGAGA CAAGACTCCG TTTTCTAGAA GAAATGAGTT TAGTCCAAGA 1500 

GGCAGTTGCC AAGGCCTTTG CTGCTGAGAA AATG AAT AT C GAACTGCTAG GAAATGGCGA 1560 

TGCTCATCTT CATTGGCATC TGTTTCCACG ACGGACAGGT GATATGAATG GTCATGGTCT 162 0 

CAAGGGTCGT GGACCAGTCT GGTGGGTTCC CTTTGAAGAA ATGAC AG C AG AAACCTGCCA 1680 

AGCAAAACCG GATGAGATTA AAAGATTAGT CAAACGTTTA TCGTCAGAAG TAGATAAACT 174 0 

ATTAGAAATA AAGGAGTAGA AATGAAGAAA AGATACCTAG TCTTGACAGC TTTGCTAGCC 1800 

TTGAGTCTAG CAGCTTGTTC ACAAGAAAAA ACAAAAAATG AAGATGGAGA AACTAAGACA 18 60 

GAACAGACAG CCAAAGCTGA TGGAACAGTC GGTAGTAAGT CTCAAGGAGC TGCCCAGAAG 192 0 

AAAGCAGAAG TGGTCAATAA AGGTGATTAC TACAGCATTC AAGGGAAATA CGATGAAATC 1980 

ATCGTAGCCA ACAAACACTA TCCATTGTCT AAAGACTATA ATCCAGGGGA AAATCCAACA 2040 

GCCAAGGCAG AGTTGGTCAA ACTCATCAAA GCGATGCAAG AGGCAGGTTT CCCTATTAGT 2100 

GATCATTACA GTGGTTTTAG AAGTTATGAA ACTCAGACCA AGCTCTATCA AGATTATGTC 2160 

AACCAAGATG GAAAGGCAGC AGCTGACCGT TACTCTGCCC GTCCTGGCTA TAGCGAACAC 222 0 

CAGACAGGCT TGGCCTTTGA TGTGATTGGG ACTGATGGTG ATTTGGTGAC AGAAGAAAAA 22 80 

GCAGCCCAAT GGCTCTTGGA TCATGCAGCT GATTATGGCT TTGTTGTCCG TTATCTCAAA 234 0 

GGCAAGGAAA AGGAAACAGG CTATATGGCT GAAGAATGGC ACCTGCGTTA TGTAGGAAAA 2400 

GAAGCTAAAG AAATTGCTGC AAGTGGTCTC AGTTTGGAAG AATACTATGG CTTTGAAGGC 2460 

GGAGACTACG TCGATTAATA CTCTTCGAAA ATCTCTTCAA ACCACGTCAG CGTCGCCTTA 252 0 

CCTACTGACT GCGTCGGTTC TATTCACAAC CTCAAAACAG TGTTTTGAGT cGATTCGTCA 2580 

GTTTTATCTG CAACCTCAAA GCTGTACTTT GAGCAstGCG GCTAGCTTCC TAGTTTGCTC 264 0 

TTTGATTTTC ATTGAGTACA AAAAGTAAAC TTTTCTCTTG CAATTCCAGA TAAATAGTGT 2700 

ATAATGGATG GGTATGTGAA AAACATACTT GTGGGAGGTA AAAATCTCTA ATTACCGCCA 27 60 

AAACCACAAA GGAGGATTTA AAAATGGCTA AAAAAGTCGA AAAACTTGTA AAATTGCAAA 282 0 

TCCCTGCTGG TAAAGCTACA CCAGCTCCAC CGGTTGGACC TGCTCTTGG 2869 
(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6186 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
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CTGAATCCCT TATAGGAGTC CAGTAACTTT TTAGCCTCTA CTTTGCCTTC ATAGGCAGCT 60 

TCAACATCAT TAAAAAAAGA ArGCACTGAA GCAAGTTCTT CAGTGCTCCA CGACAAATCT 12 0 

AGTGGGTAAC TATACTGTTT GTTCATTAAC TAATACCAGC TCTCATTCTT GCTTCTTTTA 180 

GTTCTTGCTT ACGATAACTA CGAGGGAGAA AAGCACGAAT CTCATCTTCA TTAAAACCGA 24 0 

TTTGCATACG CTTGGCATCA ATAATAATTG GACGACGCAA AAGACTAGGA TACTGCTCAA 300 

TCAAATGAAG CAATTCCGAT ACCGAAATAC TCTCTACATC AATATTCAAT TTTTGAAAAA 360 

TTTTTGAACG AGTTGAAATG ATGTCATCAG TACCATTTTC GGTCAAGGAA AGGATGTGTT 420 

GCAATTCTTT TCTTGTTAAA GGACTGGTCA TAATATTGTG TTCCACAAAG GGAACTTATG 48 0 

TTTTTCTAAC CAGGCCTTAG CCTTACGACA TGATGTACAG CTCGGTGATA GAAATAGTGT 540 

AATCATGCTT TTCTCTTCTT ATCTATACTT TGCTACTTCT ATTATACAAA AAAATAAAGC 600 

GCTTGACTAG GGATTTTTAG AAAAAAAGCC TATTTTTTCA AGAAAAATAG GCTTTTTGCG 660 

AACGATTGAC ACAATTGGAT TTGGTTAATT CACTCTTAAC GATGGTTTTA AACGATATAT 720 

ATTTTTATAT ATGTAAATTA AAAACATCTT TCCTTTCACT TCCTACGACT TTTCAGATAC 780 

AGATAGCCAA AGAAGTTTTC ATAGAGGGCA AAAAAGAGGA GGAAGGCATG AAGAAAGAAG 840 

GTCTCTGGCA AAAT C AT AAT AACAGGATCC TTGGCTGGAT CAAAAAGCCA GGTATCATCT 900 

CCCACAAAGA GAATTTGATG GAAAAGAGTA AAGAATTGGT CAAAACCAAT CAAAACTCCC 960 

CCAAGTCCAA TCATCACAGG TAAGACTACT AGAGCCAGGA GACTTTTTCG ATAAAGAGAC 1020 

AAAAAGTCCT TTTTCACAAT CCTATTGACA AAGACATAGA AACTTGGCAG TGTCACTAGA 1080 

GCTACTAGCT GAACCAAATG AAAGAGATTC TTGACCACTG CGAAATGGTG CAGACCAGCT 1140 

GCTGACGAAC GAAAATCAGG CATCTGTAAG ACCTGACTAA AAGGATTGGT CAGATAATTC 1200 

ATCAAGATAT GAAAATTGTA TTGAATGGTT TCTGGTTTTA GATAGACTCG ATTCGTTAAG 1260 

TTTAGCCACT GAATCTCCAT AGGATAGAAA ATCCAAGCCA GATAAATGGT CAGAAGGATG 132 0 

GAGAGGGAGA GGAGAAAGAG CATAGAGCCC CAAAAGATCA ATTTAGTTTT CATCAAAATC 1380 

CCACTCCGCA AGGCTAGAAA CCACATGTGT CGGTGCGATT GGCAGGCCAG CTACTTCTTC 1440 

TGCCTTAGTA AAACCTGTCG TCACCAAGAG CGTTGGAATG CCATTGTCAA TCCCAGCCCG 1500 

AATATCAGTC AAATAATTGT CCCCAACCAT GATTAACTCT TCACGTTCCA AACCTAAGTG 1560 

CTCAACCGCC TTGTCCATAA TGATGGCATT TGGTTTTCCG ATATAAACCG GCTTCACTCG 162 0 

TGTCGCTACT TCAAGCAGCG TAATCAGTGA GCCAGCACCT GGCAAAAGAC CGCGTTCCGT 1680 

CGGGATGTTG AGGTCAGGAT TGGTTCCGAT AAAATGGGCA CCCTTTTGAA TAGCAAGAGT 1740 



